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PREFACE 


A way of understanding the laws which govern the world of elementary 
particles has not been found yet. Present-day theoretical physicists have 
to be satisfied with compromises which, at the best, promise some success 
at the expense of generality and unity. 

Under these circumstances a critical analysis of the basic concepts of 
modern quantum theory may be timely and useful. It is hoped that the 
value of such an analysis may be preserved even if, in the near future, 
new ways of understanding the basis of elementary particle physics 
are discovered. In this monograph one specific aspect of this analysis 
is treated, namely the problems of geometry in the microworld. An out- 
line of geometrical measurements in the macroworld was given pre- 
viously. These measurements seem to be clear enough for at least a 
certain set of problems to be considered as a starting point for discussing 
the situation in the microworld. The concepts and methods which are 
useful in the macroworld may only indirectly be carried over into the 
microworld and they require a high degree of abstraction. 

In comprehending the physical content of dynamic variables which 
have geometric meaning, for example, the space-time particle coor- 
dinates x, y, z, £ it is often necessary to have recourse to gedanken 
experiments which, although not feasible in practice, can nevertheless 
be compatible with the basic principles of geometry and quantum 
mechanics. In a desert sea of abstract constructions there is a still larger 
distance between macroscopic concepts of space-time and the way of em- 
ploying the coordinates x, y, z and ¢ in relativistic quantum field theory. 

It is shown in this monograph that if elementary particles have a 
structure it is doubtful whether the coordinates of elementary particles, 
x, y, Z, t, can even be defined exactly, let alone the coordinates of the 
elements which make up these particles (if they do not exist only in our 
imagination). This important fact is revealed in even the most favourable 
gedanken experiments. 

From this fact, doubt arises about the logical validity of using the 
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symbols x, y, z, t as the space-time coordinates to describe phenomena 
inside elementary particles. This allows theoreticians a certain freedom 
of choice of space-time and causal relationships within elementary 
particles; in other words, an arbitrariness of choice of the geometry in 
the small. 

The last chapters of this book describe some models used to illustrate 
the situation described above. In concluding, experimental data and ex- 
perimental possibilities relating to geometric and causal problems in 
the microworld are discussed. 

Since the appearance of the Russian edition in 1970 the situation in 
the experimental physics of elementary particles has not changed no- 
ticeably as regards the problems this book deals with. The derivation 
of asymptotic theorems (§30, §47) has been found to be less limiting 
with respect to the problem of nonlocality than was thought at that time. 
The assertion that no discrepancy with local theory predictions up to 
scales of the order 10715 is observed still seems to be valid. 

The present monograph is based to a great extent on the author’s 
publications at different times. A large number of papers have been 
prepared jointly with his young colleagues at the Laboratory of Theoret- 
ical Physics at Dubna, B.M. Barbashov, N.A.Chernikov, G.I. Kolerov, 
G.V.Efimov and V.G.Kadyshevsky. The author was very glad to take 
the opportunity of reflecting in this monograph some of their original in- 
vestigations. The author is very grateful to his colleagues for useful dis- 
cussions which have greatly contributed to clarifying many controversial 
questions. The author is especially grateful to M.A. Markov and B.M. 
Barbashov for their invaluable advice and comments and for fulfilling 
the task of critically reading the manuscript, and to G.I. Kolerov for his 
assistance in preparing it. 

The author is greatly indebted to Reidel Publishing Company, thanks 
to whom the ideas of this work were, not for the first time, made available 
to a large number of physicists. 

The assistance of Mrs I.S. Zarubina, collaborator of the Joint Institute 
for Nuclear Research, in editing the English version is also appreciated. 

I hope that this book may prove helpful to all those who are concerned 
with fundamental problems of modern atomic physics. 


February 1972 Dubna — Moscow D. BLOKHINTSEV 
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INTRODUCTION 


The interpretation of Geometry presented here must not be applied 
directly to submolecular spaces... such an extrapolation might 
prove to be as incorrect as applying the concept of temperature to 
bodies of molecular dimensions.* 


A. EINSTEIN 


In modern philosophy, space and time are thought of as forms of exist- 
ence of some material. Thus it is natural to consider that the laws of 
geometry are the most general and that their region of applicability 
covers all events and phenomena in the world which we know. We shall 
not define the term ‘event’ exactly here, but take it to mean some physical 
phenomenon which is expressed in some change in material state. 

Since the time of Einstein and Minkowski [2, 3], geometry has united 
investigations of space and time and thus the concept of motion is firmly 
embedded in modern geometry. Actually this concept, together with the 
concept of time, was always implicitly included in geometry. One of the 
basic operations in Euclidean geometry, the matching of geometric 
figures, introduces the concept of displacement, or motion, indirectly into 
geometry. In the well-known “Erlangen program” [4], Klein introduced 
the group approach into geometry, which can also be considered as an 
introduction of the concept of motion into geometry. 

Putting aside this aspect of the question, we can make the most 
general assertion that geometry entails the ordering of events. Each event 
will be issued a “passport” giving its “place of residence”. The united 
geometry of space and time allows us to consider the neighborhoods of 
events in both space and time. Thus it becomes possible to base the 
order of events on causality, i.e., on the genetic relationship between 
events. 

Present-day experience shows that such an approach is possible and 
is based on laws that are inherent in our world. But we do not, strictly 


t See [1]. 
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speaking, know the limits of this approach. The world of very large scales, 
cosmology, and that of very small scales, the world of elementary particles, 
have not been sufficiently well investigated for us to know if causality has 
any meaning there. 

Weare used to considering x as the spatial coordinate and ¢ as the time 
coordinate and using them as such even inside of elementary particles. 
However, we do not know what they represent in reality. Should not the 
space-time relationships change radically in such small scale regions as 
those of the elementary particle world? 

The problems of space and time in the microworld are the topic of this 
book. 


CHAPTER I 


GEOMETRICAL MEASUREMENTS IN 
THE MACROWORLD 


1. THE ARITHMETIZATION OF SPACE-TIME 


For the purpose of ordering events, we assume that it is possible to 
assign to each elementary event P certain fixed numbers x,, X2,..., X,- 
coordinates of the event. The number of necessary coordinates n is deter- 
mined by the number of dimensions of the space-time manifold being 
considered. (For example, physical space is four dimensional; there are 
three spatial coordinates x, =x, x3 =y, X, =z, and the fourth coordinate, 
X4=t, is time.) The set of these n numbers (x)= x,, X2,..., X, constitutes 
a description or “passport” of the elementary event. 

By an elementary event P, we mean an event to which it is possible 
to assign only one set of numbers (x). Therefore, the term “elementary 
event P”, and “point P” are equivalent in our space-time. We shall call 
this process of assigning coordinates to each event arithmetization of 
points in space-time, in order not to use the word passportization of 
points in space-time, which is not popular in science. Some mathematical 
(logical) requirements have to be given for the process of arithmetization 
in space-time, for example, the requirement of a one-to-one correspon- 
dence of points in space-time P to the coordinates (x). In addition, the 
physical methods have to be shown, at least in principle, by which this 
suggested arithmetization can be carried out. In other words, we have 
to give the physical processes on which the methods of arithmetization 
can be based. 

Points in empty space do not, by the very definition of the term 
“empty”, differ from each other in any way.‘ Therefore, a real, physical 
method of differentiating between points in space-time may be based only 
on real events at these points. In other words, it is possible to arithmetize 
real events, but not points in space-time themselves. By accepting the 


t We are not discussing here the question of whether it is possible to consider real, physical 
space as empty. 
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equivalence of elementary events and points in space-time, we exclude 
from the very beginning undefined operations in a vacuum. 

In this manner arithmetization of points in space-time can be based 
on methods of differentiating between the points which are independent 
of arithmetization itself. The point, equivalent to an elementary event, 
has to have individual characteristics, which delineate the situation in 
which the event will occur. In this formulation of the problem, arith- 
metization is still rather arbitrary. The nature of the arbitrariness may 
be seen from the following example. Let the space under consideration 
be filled with material whose color changes continuously (see the drawing 
on the cover). Each point contains a certain quantity of blue and yellow 
color. We can compare every combination of colors, to the extent to 
which the color of each point is different, with any possible pair of 
numbers x,, x only if the set of numbers x, and x, is as dense as the 
set of color points. The great arbitrariness of such a comparison may 
be seen from the fact that we should, in going from one color region to 
another, assign any value to the size of the steps between numbers 
X,, X2 without affecting the one-to-one correspondence between x,, x2 
and the color points or “elementary events”. 

Let us now assume that we have a method available for measuring 
the concentration of the blue and yellow color at any given point, letting 
that of blue be a, and that of yellow be b. Then the concentration could 
be taken to be a pair of numbers (a, b) for the arithmetization of our 
events. Such arithmetization would directly reflect the structure of our 
manifold and the continuous character of the measurements of the color 
points. The choice of any other system of arithmetization such that 
x,=X,(a, b) and x,=X,(a, b), [we assume a one-to-one relationship 
between the sets of points (x,, x2) and (a, b)], would have a firm physical 
basis in a “prefered” or “natural” system of arithmetization (a, b). 

The “preferedness” or “naturalness” is based on the existance of a 
physical method of defining the color composition of the “event”, 
which is not intrinsically related to arithmetization. 

When selecting another system of arithmetization (x,, x2), parti- 
cularly a discontinuous one, we have to remember in the analysis of events 
in our color space that jumps in the system of passportization of points are 
artificial. In the same way it would be artificial, for example, to in- 
troduce on the same street several numbering systems for the houses so 
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that the number of each house does not follow that of its neighbor, but 
is different and arbitrary. 

This simple example shows that the method of arithmetization of 
points in space-time has to be based on real processes, and has to represent 
objectively the structure of space-time in the most direct manner possible. 
Arithmetization is not possible without using material processes, and 
therefore it is not possible to separate physics from geometry [5, 6]. 


2. THE PHYSICAL METHODS OF ARITHMETIZATION 
OF SPACE-TIME 


Before Einstein, theoretical physics was based on Euclidean geometry 
and on the concept of some absolute time. This formed the basis on which 
any occurrance could be described. The idea that material phenomena 
could be associated with the geometry of space and time was foreign to 
physicists of that period. 

Newton defined space and time in his Mathematical Principles of 
Natural Philosophy thus: 


Absolute, true, and mathematical time, of itself, and from its own nature, flows equably 
without relation to anything external, and by another name is called duration .... Absolute 
space, in its own nature, without relation to anything external, remains always similar and 
immovable. [7] 


The complete independence of geometry of physics is stressed in these 
definitions. However, as was first shown by Helmholtz [8], Euclidean 
geometry is based on the concept of the existance of rigid bodies. The 
basic assumptions are: 

(i) N-dimensional space is an n-fold extended manifold ,,(x). In other 
words each element of the manifold (point) is defined by n con- 
tinuously varying quantities — the coordinates of the element: 
Kip Nea sens Noge 

(ii) The existence of moving, but unalterable (rigid) bodies or systems 
of points is assumed. Between the 2n coordinates of each pair of 
points that belong to a rigid body there exists an equation that is 
independent of the motion of the bodies and is the same for all 
mutually coincident pairs of points. 

(iii) Complete independence of motion of rigid bodies is assumed. It is 
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assumed that each point can continuously change from one to 
another in so far as this is not prevented by the bond imposed by 
assumption (ii). 
(iv) Two superimposed bodies remain superimposed even after one of 
them undergoes a rotation about some axis (uniqueness). 
Considering these statements for the case where n=3 and, assuming 
infinite space, Helmholtz showed that the quantity 


ds? = Yo dx? (2.1) 


remains constant under all rotations of a rigid system about the point 
dx;=0. Therefore, this quantity may be taken as a measure of distance 
in space — of points x,, X2, x3 and xy +dx,, x, +dx, x3 +dx3.t 

In the conclusion of his research, Helmholtz stressed 


that the possibility of our making spatial measurements depends, as the preceeding has 
shown, on the existence in nature of bodies which approximate sufficiently closely our con- 
cept of rigid bodies. The independence of compatibility from location and direction of coin- 
cident forms, and on the path by which they are brought into coincidence, is the fact on 
which the possibility of making measurements in space is based. 


A modern summary of Helmholtz’s research may be given by a state- 
ment that Euclidean geometry is based on group motion of rigid bodies. 
This group contains six continuous parameters, including both oper- 
ations of rotation 03, and of translation 7. Fa 

Instead of motion of rigid bodies, we may consider the transformation 
of points in the Euclidean manifold itself, 

x’ = Ax, (2.2) 
where A is some linear operation on the coordinates x(x,, x2, x3) which 
keeps the geometry of a rigid body unchanged and transforms the point 
P(x) of the manifold R(x) to the point P(x’) of the manifold R(x’). 
This wider point of view allows us to bring into consideration, not only 
continuous transformation corresponding to motion, but also discrete 
changes in coordinates. In particular, the inverse transformation changes 
the sign of the coordinates 


x=-x. (2.3) 


t Helmholtz’s result leads to the Riemann distance metric. 
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All such operations retain the invariant fundamental] form 
L(x, x!) = (X1 = x4)? + (x2 = 2)? + (x3 = 5)’, (2.4) 


where (x4, X2, x3) and (x1, x2, x3) are the coordinates of two points, 
P(x) and P(x’). This form is the same as the square of the distance 
between the points P(x) and P(x’). 

From this discussion of the ideas concerning the nature of Euclidean 
geometry, we see that in the basic definition of measure, an ideally rigid 
indexed scale may be used. This ancient method of measuring distances 
still has great practical value in our time. 

Arithmetization of points in space may now be carried out with the 
help of operations in which the rigid scale is plotted. The occurrence here 
or there of some other index on the scale is in itself a material event, and 
in this case the process of arithmetization of points in empty space auto- 
matically follows the filling of the space with material. A point in space 
P, with coordinates x [we will often denote such points with the symbol 
P(x)], will indicate those points which mark the position of the end of the 
single rigid scale /, if it has been plotted x times. The rigidity of the 
scale also assumes that it does not vary with time. Therefore, if the rigid 
rules is plotted again at some later time ¢, and if this is repeated a 
further x times, then by definition we obtain the identical point in space 
P(x). 

We note that there exists an elegant possibility of verifying the 
Euclidean character of n-dimensional space by measuring with the aid 
of the rigid scale an n-dimensional tetrahedron stretched out to n+1 
points which are its vertices. 

Let us take one of these points for the origin and assign coordinates 
x! to the remaining points, where i= 1, 2,..., n (the number of points) and 
a=1,2,..., (the number of coordinates). The volume of such a poly- 
hedron V,, is expressed in the form 


xt xh..x) 
1] x? x3... x? 
A Seay, (2.5) 


n n 
Xi Xle Xn 


Interchanging rows and columns and multiplying (2.5) the resultant we 
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obtain 
(xix!)  (x'x?)... (x!x") 
2_ 1 | œx) (x?x?)... (x?x") 
eo ao ae ae ee 


(2.6) 


The general term of the determinant has the form 


XaXa 


Me. 


dy = xxt = $f (x! — x} — (x)? —(x49?]. 27 


a 


1 


In other words, the volume J, is a function only of the distances between 
the vertices of the polyhedron taken in pairs. For n-dimensional Eucli- 
dean space, V,,,=0. If V4, #0 for this space, this would mean that the 
geometry of the space is non-Euclidean. 

In order to carry out an exhaustive arithmetization of events, it is 
necessary to have a standard time as well as a standard scale length. 
Such a time must also be “rigid”, i.e., it must have a constant period t. 
Thus the interval of time ¢ will denote the number of measured periods z, 
which we consider as unity. This assumed “rigidity” of time is mathe- 
matically equivalent to uniformity of time, to which the previously dis- 
cussed group of transformations in Euclidean space has to be supple- 
mented by the operation of translation in Time 7,, which retains invariant 
the length of the time interval r=1,—1,. Thus the general group of 
transformations, “characteristic” of the geometry of Euclid and Newton, 
may be written as the product of the operations 


g=0,1;7,P. (2.8) 


The treatment of higher transformations is related to the specific frame 
of reference, by which we mean the ensemble of rigid bodies and rigid 
times, which are motionless with respect to each other, and which serve 
for the arithmetization of points in space and time. 

It is possible to imagine another system moving with a uniform 
velocity v with respect to the other. Let us set the direction of this velocity 
along the axis Ox. Then the transformation of the coordinates and time 
may be written as 


x, =x, + vt, X3 = Xz, X3 = X3, St; (2.9) 
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which is the Galilean transformation. Strictly speaking, this already goes 
beyond the bounds of geometry because it operates separately in space 
R, (x) and time T(t). The meaning of this transformation is revealed if 
we go over to mechanics. 
The Galilean transformations retain the invariance of acceleration, 
d?x’ d?x’ d?x 
a a? a” eae 


and, therefore, the basic law of mechanics - Newton’s second law — 
remains unchanged. This invariance is expressed in the Galilean principle 
of relativity; all mechanical phenomena occur in the same manner in all 
inertial frames of reference. 

Let us now formally combine the space R;(x) and time T(t) in one 
general manifold ®,(x, t). By denoting the Galilean transformation 
operation (2.9) by g3, we may write a more general operation in the 
manifold space-time R(x, t) as the product of the operations 


§ = 9303737. (2.11) 


This is, strictly, the Galilean transformation. It may be extended to an 
improper transformation, if the operation g is extended by additional 
operations of the inverse P of the coordinates and operations of the 
reversed time 7. This last operation denotes the changes 


t=—¢. (2.12) 
Thus the general operation of improper Galilean transformation is 
written in the form 


Ou = GPT. (2.13) 


In conclusion, we shall make a few remarks concerning the selection 
of a coordinate system. In considering Euclidean geometry, we used the 
Cartesian coordinate system x,, X2, X3, in which the fundamental form 
L(x, x’) has the form (2.4). In few coordinate systems are the inner 
characteristics expressed in a natural manner. A number of paradoxes in 
the general theory of relativity are due to the fact that natural phenomena 
do not differ sufficiently from “‘~pseudophenomena’”’ which are caused by 
the sometimes unfortunate choice of the coordinate system, or in general, 
by the method of arithmetization of events. 
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To explain this, let us consider a method of arithmetization of points 
in space with measurements made using a stretched elastic thread so 
that the displacement x will be expressed in units of force fof the stretched 
thread. Insofar as Hooke’s Law applies to small extensions, 


x= af (2.14) 


where « is a coefficient. Since changes in geometrical lengths are pro- 
portional to the applied forces, all Euclidean geometrical relationships 
are preserved. However, it was found experimentally that Hooke’s Law is 
applicable only up to a certain limiting value of f, beyond which the ex- 
pression for the displacement takes on the form 


x=o(f)=af + Bf? +, (2.15) 
or 
f =x), (2.15’) 


where g ! is the inverse of the function g. 

The relationship between the circumference and the diameter of a 
circle would now be expressed as 

Jere p = (27r ) 
= — E x Tl. 2.16 

Saiam p 1 (D) ( ) 
The number z; would depend on the dimensions of the ring. Similarly 
Pythagoras’ theorem would become more complicated and have the 
form 





Tip 


San + Sec # fac- (2.17) 
The well-known inequality or the sides of a triangle ABC, 
AB + BC > CA, (2.18) 


would also not be satisfied for triangles whose longest side is C4, because, 
as the extension x = CA increases, the force of tensile strength fç 4 becomes 
relatively smaller. (The length of the thread tends to infinity for ter- 
minal values of f.) 

Let us consider how uniform motion would appear if we assume that 
we have a good, rigid clock. We obtain 


d?f = (dx\* d?@7~! (x) 
— = 0 
dt? (=) dx? 





(2.19) 
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In other words, a uniformly moving body would appear to be slowing 
down. 

This method of arithmetization of space, using a stretched elastic 
thread, is not in itself meaningless and should not be rejected. In fact, 
because of the one-to-one correspondence between the displacement x 
and the force f, it is one of the allowed methods of ordering points in 
space. Moreover, on small scales, in regions where Hooke’s law applies, 
this method is equivalent to the usual one, of measurements made with a 
rigid scale. Why, then, must we nevertheless prefer the use of a rigid 
measuring rod? 

The answer to this question may be given by comparing (a) the degree 
of universality of the method and (b) the degree of generality of the 
regularities which are revealed by some or other method of arith- 
metization of space. 

Let us first consider point (a). All real rods depart in various ways from 
the ideal “absolute rigidity’. However, there exists a vast domain of 
scales and temperatures in which these deviations are not large and may 
be accounted for by suitable corrections. 

Real elastic thread also deviates in various ways from ideal elastic 
thread, which obeys Hooke’s law strictly, but these deviations can not 
be considered in the form of corrections because they significantly 
affect the result of the measurements. Thus the generality of a ‘““dynamo- 
meter” as an instrument for defining lengths is rather small. Each dy- 
namometer would introduce its individual characteristics into the 
space-time relationship of events. 

The second point (b) also does not speak in favor of measuring 
distances with a stretched elastic thread. If we accepted this method, we 
would determine that small circles have constant relationships and large 
circles have various relationships [see (2.16)]. The additivity of segments, 
generally speaking, would not hold; AB + BC#AC. For some triangles, 
the inequality of their sides would be fulfilled, and for others it would 
have no meaning. A body, not subjected to a real force, would slow 
down, etc. This is what a world would look like to us in which geometrical 
measurements are made with a stretched thread and a dynamometer. 

If we were to cross over at an opportune time to arithmetization 
with a rigid scale, we would find the well-known laws of Euclidean 
geometry and the law of inertia. It would appear that, for all circum- 
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ferences 2xr/D=T7, for all triangles AB + BC > CA, for all bodies that are 
not subjected to a force, d?x/dt? =0, etc. The laws that applied only to 
small scales before, f < fo [fo being the value of the extension for which 
the function (f) begins to differ noticably from af where «=constant], 
in the new system of arithmetization would extend by themselves over 
the whole region of importance of x and become universal. 

Thus not all systems of arithmetization of space and time exhibit 
internal consistency in the space-time relationship. Amongst the possible 
systems and methods of arithmetization there exist methods possessing 
the maximum universality and the maximum capability of exhibiting the 
most general regularities in the interrelationship of events. 


3. ON DIVIDING THE MANIFOLD OF EVENTS 
INTO SPACE AND TIME 


If some event P is able to exert influence on a second event P’, it is possible 
to use such an event as a signal propagating from the point P to the point 
P’ in the manifold of events ®,,(P).* 

The set of events P’, on which the events originating in P are able to 
exert influence, forms the manifold T,(P, P^). We shall call this mani- 
fold the absolute future, with respect to the point P. The set of events 
P’ that are able to influence the events in P form the manifold T_(P, P’). 
We shall call this manifold the absolute past with respect to the point P. 
If an ordering of causes and events occurs, then the domains T, and T_ 
do not have common points except for the one point P: 


Tras =. (3.1) 
The manifold formed by the union of the domains T, and T_, 
T, u T= T(P, P), (3.2) 


will be called the time domain, with respect to the point P. We shall call 
the remaining part of the manifold 


S (P, P’) = R(P)— T(P, Ph, (3.3) 
* R,(P) denotes an n-dimensional manifold of events: R,(x) denotes a manifold after 


arithmetization — 1.€., a coordinate (x) ıs assigned to each point P This will sometimes be 
stressed by the symbol P(x). Obviously R,,(P(x)) =, (x). 
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the space domain or, more simply, space, with respect to the point P. 
In order to make this process of dividing manifold ®,,(P) into space 
S(P) and time T(P) more specific, we assume that physical events have 
meaning in some scalar field p=@(P) that satisfies the second-order 
differential equation in the manifold ®,,(P) of the form 
2 


REO: 3.4 
= 0x,0Xp 2 oa 


where the coefficients A,, and R in general are functions of the field g and 
its first derivatives y, =0q/0x,, Q is an external source of the field and 
depends only on the point P; and a, B=1, 2,..., n. 

We shall now consider propagation of the signal subject to Equation 
(3.4). By a signal, we mean the wave whose front is defined by the func- 
tion 

S = S(x,, X2,..., X,) = const, (3.5) 


A discontinuity of the first derivative occurs at the boundary at the front 
(see Figure 1). Such a discontinuity is called weak. From the theory 








Av 
! 
l 
| 
| 
| 
Loene front 
> X 
Fig. | A weak discontinuity on a wave front surface. 


of differential equations [9, 10] it is well known that the equation for the 
surface of such a discontinuity is completely defined by the coefficients 
of the second derivatives of Equation (3.4) 


ðS aS _ 


pha he 3.6 
** Ax, OX, 69) 


To an accuracy of the order of (1/œw), this equation agrees with the equa- 
tion for the phase © of high frequency, œw, waves 


p =ae??, woo. (3.7) 
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Substituting (3.7) in (3.4) for woo gives the equation 


OD öp 1 
—— + 0(—)=0. 3.6’ 
* Ax, OX, Š (5) ee) 


Equation (3.6) is an equation from geometrical optics. The vector N, 
which is normal to the surface 
= @(x,, X2, ..., Xn) = const (3.5’) 
agrees with the direction of the radial lines and has components 
op 
N. 


a~ 
OX, 





Let us take a real random vector €(€,, €5,...,€,) and generate the 
quadratic form 


F(é, č) = A,poaSp $ (3.8) 
If this form is definite, 
F(é,é)>0 or F(é,é) <0 (3.9) 


for any č, then the equation for the normal vector N 
F(N, N) =0 (3.10) 


will have imaginary roots and, therefore, there will be no propagation 
of the field ọ in the manifold ,, (x). In this case the events form a motion- 
less world. If we forcibly vary the field ọ, for example by varying the 
source field Q(x) in the vicinity of some point P, so that 


ðQ (x) =e6* (x — xp), (3.11) 


where e is a small number, then the state of the field will change in all 
manifolds ,(x). In this case Equation (3.4) would become an elliptic 
function [9], and no separate manifold would arise within ,,(x). 
Let us consider the second case, when the form of ®,,(x) is indefinite, 
namely, 
F(é, 2) 50 3.9) 


depending on the domain of values of the variables č. In this case, which 
is called hyperbolic, the actual directions N satisfy Eq. (3.6). The ensemble 
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of these directions forms a characteristic cone. This cone (see Figure 2) 
divides the manifold ,(x) about each point P into two regions: 
T(T=T,+I_) and ©. Region T, may be reached by any signal 
originating from P and having a velocity of propagation smaller than 
the signal velocity v, that is characteristic of the field @. Similarly, the 
region T_ contains points P’, from which signals can reach the points 
P if their velocities v<v,. The region S contains points which are 
reached by signals from P and which have a velocity v >v. 











3 
Fig. 2. The time manifolds T, and T- and the spatial manifold S. ABCD ıs the cross 
section of the characteristic cone. 


We may restate this by saying that the characteristic cone divides the 
manifold R, (x) into manifolds T and S with reference to signals of the 
type ọ. In the region T (in the upper cone) lie events which may be 
genetically related to event at the point P; in the region T_ (in the lower 
cone) lie events which may influence the character of event P. 

We shall now consider the case when there are two types of fields, g 
and y, which have different velocities of propagation v, and v,. Let 
|v,|<|v,|. The characteristic cones for these two fields are shown in 
Figure 3. We shall investigate the situations which arise when we use 
either the signal ọ or y for dividing space and time. To take a specific 
case, we define v, as the velocity of sound and vy as the velocity of light 
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à 9 9. Y 
Fig. 3. Two characteristic cones for the fields g and y. 


(\vg|<|v,|). We further assume that, within the population, there is a 
blind group that does not accept signals y; their shortcomings will be 
the basis for their selection of the sound field as the field with which they 
determine the relationship between phenomena. And so these people 
divide the manifold ®,(P) into T, and S,, but have no idea of the 
division of ®,(x) into T, and ©,. How will the interrelationship of 
events in the manifold ,(P) appear to them? 

We shall examine this relationship using Figure 4. Let A and B be two 
spatially separated points. The distance between them could be measured 
with a sound signal v,, or with a rigid scale. Let the event at A be a spark, 
which produces both light and sound. At the point A’, an observer re- 
ceives sound reflected from point B (an echo); but the light from the 
spark, arriving at point B”, can produce noise which will reach the 
observer at A at the point A”, preceding in time the point A’. The blind 
observer will come to the conclusion that the sound phenomenon A” 
violates the law of cause and effect. The signal A” arrived earlier than it 
could be expected if its cause were the reflection of sound from point B. 

However, this same observer could discover the constancy with which 
the phenomenon A” precedes the phenomenon A’, and come to the cor- 
rect conclusion that, in addition to the sound field g, there exists yet an- 
other field y hidden from him that propagates with a velocity v, which 
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is greater than the speed of sound v,. In other words, the velocity v, 
which he had assumed to be limiting is in fact not so, because there 
is another velocity v, >v,. And so the blind observer would be faced with 
a dilemma; either to allow the existence of events which disturb causality 
(with reference to the dividing of R,(P)=2,+©,), or to review the 
method by which he divided R,(P) into T and GS. We see that for the 
ordering of events with observance of causality, it is necessary to use the 
signal having the greatest velocity of all the possibilities. 





x 


A B 


Fig. 4. The dark straight lines represent propagation of sound, the wavy line represents 
the propagation of light. The sound signal (B”— A”) appears to violate “acoustical” 
causality. 


To what extent the division in space and time is closely related to 
the physics of the phenomena, it is possible to see from this case that if a 
constancy in precedence of the event A” by the event A’ were not dis- 
covered, the observer would, to the regret of science, consider the exis- 
tence of phenomena that do not obey causality. 

Such a situation is not difficult to imagine if one allows that the state 
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of the medium (“ether”) through which the signal y propagates, would 
be not less capricious than, say, that of the Earth’s atmosphere, and this 
state would influence the velocity of the signal v,. The time 4A’ would 
then be subject to chance variations which would at least make the inter- 
pretation of the “unforeseen” phenomenon at 4 more difficult. 

In concluding this section we shall consider one curious example, 
which will serve as an additional illustration of the principle of dividing 
manifolds R(P) into time T and space S manifolds [11]. Let the field @ 
obey the scalar Klein’s equation. We first consider the case of two mea- 
surements x, and x,. In this case 


Co &o 

—- = ;+mo=0. 3.12 

6x2 ôx? vee (3ta) 
Let x,=¢ be time and x, =x be the spatial coordinate’: in Equation 


(3.12) we had two signs for m? (m being proportional to the mass of par- 
ticles in the field ọ). Assuming ọ ~e+#%* 7, we obtain 


w=+t,/k +m. (3.13) 


From this it follows that, for particles with real masses m? > 0, the group 
velocity of waves — and, therefore, the signal velocity — will be equal to 


do k 
=—=—<1l. 3.14 
a ae (3.14) 
Conversely, for the case m? <0, 
k 
u=—>1. (3.15) 
@ 


Therefore, in the first case the signal would propagate inside the cone 
T (see Figure 5) and if events were ordered with the help of the fastest 
signal u=1, then, because u < 1, normal ordering of cause and effect will 
be fulfilled by all signals. If, however, m? <0, then the signals that prop- 
agate with a velocity u> 1, will violate causality (they will propagate with- 
in the space cone S). 

It is easy to see, however, that the interpretation we took for the vari- 
ables x, and x, in the manifold R,(x), is inaccurate [11] for the case 


t In Equation (3.12) the speed of light c is set equal to 1. 
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Fig. 5. In the case of m?>0 x, =1, x, =x; for m? <0 x, =x, x, =t, i.e., the spatial and 
temporal regions change places. 


m? <0. In fact, by definition of the time region T, this is a region inside of 
which signals propagate. Therefore, in the case where m? <0, the correct 
interpretation requires that x,= x, in the region ©, and x, =ż, in the 
region T. This is immediately evident from Equation (3.12) which trans- 
forms into itself when m? > —m?, x42 X14. 

In the four-dimensional case R, (x), the situation is more complicated. 
In this case, the equation takes on the form 


po op op Co 
ôx? ôx? ôx? Ox? 
By writing the equation in this form we assume that we purposely con- 
sidered x, as time ¢, and x4, x2, X3 as the three spatial coordinates. We 


now look for the Green’s function for this equation. For this, we turn to 
the inhomogeneous equation 


+m’o =0, (3.12) 





eG &G &G 8G doa ani 5 5 5 
ax? ôx? Ox2 = ax? + m°G = 6(x4) ô (x3) 6 (x2) ô (xy). 
(3.16) 
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G(x) = fe (q) e'** d*q, (3.17) 


where gx=)'#.19;X;=)8=19%4X4—G4X4, and substituting this into (3.16) 
we get 


3 1 
G(q) == E> (3.18) 
Org + asm al 
and, therefore, 
eax 1 
G(x) = e (3.19) 
[q + — qa) 


where q=(q,, q2, q3). In the case m?>0 we may consider q, as the fre- 
quency w= +(q?+m7)!/?. The group velocity of all waves emanating 
from the point x =0, is u=0@w/0k <1 and we can interpret the coordinate 
x, as time t, and the coordinates x4, x2, x3 as the three spatial coordinates. 
For m? <0 [minus sign in Equation (3.19)], signals originating from the 
point x=0 with a velocity u=ém/ék > 1, will go inside the cone D, as in 


X2 





Fig. 6. For m?>0, a signal from the vertex of the characteristic cone is concentrated 


inside the region T which is therefore considered as temporal. For m? <0, the signal pro- 
pagates in the region © and in this case x, =¢, x2=12, X4=X. 
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the two-dimensional case, and if we also insist in this case on considering 
X4 as time, we come to a contradiction of the concept that the zone of 
influence in the case of m° <0 is the region inside the light cone (see Figure 
6). We must therefore admit that we are concerned with the manifold in 
which there are three times, £ =x,, ta =X , t3=X3, and the one spatial 
dimension x= x4. 


4. AFFINE MANIFOLD 


As was discussed above, various methods of arithmetization of a manifold 
of events are possible in principle. Some of them will naturally reveal the 
deep characteristics of the space-time relationship, others will behave as 
a warped mirror in which reality is distorted. This caricature can go so 
far that reality becomes almost unrecognizable or, philosophically speak- 
ing, almost unknowable. 

The most general characteristics of geometric manifolds which have 
to be reflected in the system of arithmetization that is chosen, can be 
listed as follows: (a) continuity, and (b) characteristic topological con- 
nectedness‘. We shall interpret (a) by assuming that around each point 
P(x) in the manifold R,(P(x)), there is a region formed by neighboring 
points. 

If this manifold is deformed without forming discontinuities, then the 
neighboring points remain neighbors and the structure of the manifold 
will be characterized exclusively by features of the topological connected- 
ness [point (b)], i.e. by the ensemble of deformations that do not disturb 
the continuity of the manifolds. In Figure 7, three-dimensional surfaces 
are shown, which represent members of a manifold. The surfaces of the 





Fig. 7. Four topologically equivalent surfaces. 


t The definition (a) excludes from consideration manifolds that consist of a great number of 
points, e.g , a spatial lattice. 
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bodies a and b, and c and d, taken in pairs, are topologically equivalent 
but the surfaces of a and b are not equivalent to those of c and d, because 
the former surface cannot be changed into the second type by a con- 
tinuous deformation. 

We shall carry out an arithmetization of the manifold such that the 
neighboring points in the manifold will correspond to the neighboring 
numbers. In view of the continuity of the manifold, these numbers will 
also be continuous. 

In accordance with the dimension n of the manifold R,( P(x)) = &,,(x), 
the number of numbers that characterize any point in the manifold is 
equal to n. Earlier, we called these n numbers the coordinates (x) = 
(Xi, X2,..., Xa) Of the point P. It is possible to select these coordinates in 
various ways, and, from the mathematical point of view, it is sufficient if 
the various possible coordinate systems have a one-to-one relationship 
between them, if not throughout the manifold R,(x), then at least in its 
separate parts that cover the entire manifold ¥,(x). If such parts are 
R (x), Ri (x), R(x), ..., then 


R(x) Ri (x) ORY RP, 


The general coordinate transformation in any one of these regions 
[which may cover the whole manifold ,,(x)], can be written in the form 
(12, 14] 








x" = f"(x), (4.1) 
where f"(x) is a differentiable function and p=1, 2,..., n. We write 
ôx” Ofe 
= = 4.2 
ae ae vo 
so that 
dx’? = af dx’. (4.3) 


We consider Equation (4.3) as solvable for dx’, so that the determinant 
loch] AO. (4.4) 


In the manifold R(x), we may examine some functions of the point 
P: the scalar field ọ( P), the vector field 4” (P) (u = 1,2, ..., n), the tensor 
field of various ranks T*’---(P)(u, v,- = 1, 2,...,2), which transform in 
the following manner under transformation of coordinates (4.1): 
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gy (P') = o (P) = o (P), (4.5) 
A" (P’) = ay (P) A” (P), (4.5) 
T(P") = ag (P) ap (P) T°? (P), (4.5) 


By P’ we mean the point that is obtained by transforming point P by 
(4.1). Vectors and tensors which transform according to (4.3), (4.5’) and 
(4.5”) are called contravariant. 

We can consider vectors A,(P) and tensors 7,,...(P) that transform 
according to 


A, (P’) = B; (P) A, (P), (4.6) 
Tav.. (P') = Pa (P) BAP) = Tap... (P), (4.6') 
where 
ox’ 
pa = axe (4.6”) 


Such vectors and tensors are called covariant [14]. 

Because vectors in various points of the manifold transform in various 
ways [the coefficients «# (P) and p} (P) are functions of the point P], the 
comparison of two vectors at different points in the manifold A(P) and 
A(P’) cannot be made without further operations. In order to compare 
these two vectors, the vector A(P) must be brought over to the point P’, 





Fig. 8. A parallel translation of a vector along the closed contour PP’P” which 1s made 
up of segments of geodesic lines. 
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keeping it parallel to itself. During such an operation, the components of 
the vector A“( P) undergo a change 6A“ (see Figure 8). We consider such 
a change for two points infinitesimally close to each other, P(x) and P’ 
= P(x+dx). The magnitude of ôA“ has to be a function of the com- 
ponents of the displaced vector (6A"=0! for the nul vector), and also a 
linear function (the sum of the vectors must transform like a vector). 
Futhermore, the magnitude of 6A“ has to be proportional to the displace- 
ment dx”, for dx*=0, 6A*=0. Therefore, we may write ôA” in the form 


5AM = — THA" dxf, (4.7) 
where I’, is a function of the points P. 

The coefficient I, is called the coefficient of affine connectedness, or 
Christoffel’s symbol. The complete change of the vector A(P) in going 
from point P to P’ is equal to 

DA" = dA" — 6A", (4.8) 
and, correspondingly, the derivative of the components of the vector A 
with respect to x, is 
DA" GA" 
Dx, ôx, 
This is called the covariant derivative [13]. 
Let us now consider some line in an n-dimensional manifold 


x" = X#(z), (4.10) 





+T A. (4.9) 


where t is a parameter that defines the line. Amongst the various lines 
that pass through the points P and P’, there is one whose tangent 

dx“ 
~ dt 


remains parallel to itself. On this line the covariant derivative of the 
tangent vector is equal to zero; i.e., 


(4.11) 


ut 


Dut = du" + F',u* dx? = 0 (4.12) 
or 

du” PH tye = r 

de ape = 0. (4.12) 


Such a line is called a geodesic [12-14]. 


GEOMETRICAL MEASUREMENTS IN THE MACROWORLD 23 


By definition, the parallel translation has to conserve the “angle” be- 
tween the tangent to the geodesic line and any other vector, if it is trans- 
lated along the line. Therefore, for any vector A, we have 


DAY =0. (4.13) 


This change of the components of a vector under parallel translation 
is a postulate affine geometry, and is closely related to the concept of 
curvature of an affine manifold. 

Let us consider an infinitely small contour C that consists of geodesic 
lines (see Figure 8). We translate the vector A parallel to itself along the 
contour. Then the size of the component of the vector is defined as 


da= l rA dw. (4.14) 
c 
Applying Stokes’ Theorem, we obtain 
4A" = f [e Cio aes] Af” (4.15 
2| ôx, ax? i Aa 


where Af”? is the component of area drawn along the contour C. The 
partial derivative in (4.15) can be expressed, using (4.13), in terms of the 
components of the vector A. We thus obtain 





4A" = — 3 Rip Af”, (4.16) 
where 
ore, ore y 
Roos a re = ae + iw ase = Tiol ip (4.17) 
is the curvature tensor of the manifold ( Riemann tensor ). We can show that 
this tensor is antisymmetric with respect to the indices a, p, i.e., 








Rapo 0 Risp ’ (4.18) 
and 
Rips + Reps + Roa = 0. (4.19) 


5. THE RIEMANN MANIFOLD 


The affine manifold considered above is still rather general. If affine geo- 
metry is supplemented by the concept of separation between two points, 
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or in other words, if it brings a metric into the manifold, then we arrive at 
metric geometry or Riemannian geometry. Instead of the distance between 
two points P and P’, we will use the more general concept of interval, 
and keep in mind that the manifold contains both space and time. 

We shall denote the interval between two infinitely close points P(x) 
and P(x+dx) by ds. According to Riemann’s basic hypothesis, ds? is the 
squared form of the separation of the coordinates of points P and P’: 


ds? =9,, dx“ dx’, (5.1) 


where g,,,=9,,(P) is the symmetric tensor g,,, =g,,,, known as the metric 
tensor. If we rotate the coordinate system in the neighborhood of P, and 
by proper choice of scale, (5.1) can be rewritten in cannonical form [14] as 


ds? = ¢,,,(dx"), (5.1) 


where ¢,,,= +1. The quantity f=)" =; Epu is called the signature of the 
metric. If the signature is equal to the number of dimensions, f =n, then 
the metric form is 


ds? = F (dx“)? (5.1") 
=i 
and it expresses Pythagoras’ theorem for infinitely small regions in n- 
dimensional space. If f #n, then the interval ds? may be written in the 
form 


ds? = f (dx — $} (dx")?, (5.1) 
u=1 u=m+1 

so that f=2m—n+ 1. In this case the Pythagoras theorem is satisfied in 
all manifolds R,, (x) or R,,_,,(x). Such a space is called pseudo-Euclidean. 
The choice for the quadratic form for ds? has been considered many 
times. In fact, this hypothesis of Riemann shows that “ordinary” geo- 
metry, which is based on the concept of rigid bodies, holds in the neighbor- 
hood of each point P. For this type of geometry, the basis of the quadratic 
form, which defines the concept of separation, was given by Helmholtz 
(compare with Section 2). In this manner, the general case of Riemannian 
geometry reduces either to Euclidean, or to pseudo-Euclidean geometry in 


t We note that here the concept of the infinitely small has a pure mathematical meaning. 
Physically it could be a cubic kilometer in the Galaxy, or a cubic fermi in an atomic nucleus. 
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the neighborhood of each point in the n-dimensional manifold R, (x). 

We recall several important formula of Riemannian geometry. The 
metric form (5.1) enables us to define the scalar product of two vectors 
(AB), taken at the point P: 


(AB) = g,,A"B’ = A“B,, (5.2) 
where 
B, = GuyB" (5.3) 
is the covariant form of the vector B. 
Solving (5.3) in terms of B”, gives 
BY = g™”B,, (5.4) 


where g"”= M*"/g, and M*” is the minor of the determinant 


g = lgu. (5.5) 

From the definition of tensor transformation (see Section 4), it follows 
that the scalar product is invariant under coordinate transformation (4.3). 
The length of the vector 4, or, to be more exact, its square must not 
change under parallel displacement from P to a neighboring point P’: 


Ľ = (AA), (5.6) 

and the angle between any two vectors A and B must not change 
AB 

cos @ = A a (5.7) 


~v (44A) V (BB) 
i.e., the covariant derivative of the scalar product (4B) must be equal 
to zero; and 
D(AB)=0. (5.8) 
This relationship allows us to express the coefficients of affine connect- 
edness T% in the terms of the metric tensor g,,. We obtain 


1 Gea C9 pa OFap 
pe, a1 guo ( Goa | 29p0 _ Uap), 5.9 
AES (5 oxe Ox? GA 


From Equations (5.9) and (4.17) we may also obtain an expression for the 
curvature tensor from the second derivative of the metric tensor. If we 
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introduce the covariant curvature tensor 

Ryapy = InaRp, 5 (5.10) 
then we find that 

R 1 ( 07 Guy 7 Jap 0 up 07 Gap ) 





naby = 9 \ ax2ax® | GxtOx? Ax®Ox! _ x#Ox? 


+ Ipo Cipla — TET ap) (5.11) 
In the general theory of relativity, the symmetric tensor 
Ryy = Riav (5.12) 
plays an important role, as does the scalar curvature 
R=g"R,,. (5.13) 


Thus, in metric Riemannian geometry, all quantities that are important in 
affine geometry can be expressed in terms of the metric tensor and its 
derivatives. 

We will also consider group properties of Riemannian geometry. For 
this purpose, instead of general coordinate transformation we use the 
infinitely small transformation 


xt = xt + EM (x), (5.14) 


where é (x) is an infinitely small vector. It is also possible to consider this 
transformation as an infinitely small displacement of the point P to the 
point P’= P(x’). For such a displacement, the metric tensor undergoes 
the transformation 


Ox? Ox? 


Be = aa gxr Soe 


ag? (P) ô’ (P) 


= Guv(P) — Jup (P) ax” — Gov (P) öx" . 








(5.15) 


On the other hand, the original metric at the point P’ may be ex- 
pressed in terms of the metric at the point P by means of the expansion 


guy 
OX, 


Juy (P) = Guy (P) + a= E (P) +- (5.16) 
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From this, the change in the tensor g,,, caused by the displacement P— P’ 
is equal to 


age OEP OG uy 
Ade = — Due Bx — Doe Gye Bye E H Gy) 





The metric does not change under this displacement if Ag,,=0, i.e., if 


Caer o ii 


Juo ga T Io Bye Axe 
These are equations of the group of infinitely small displacements. They 
are called the Killing equations [12.14]. If it is possible to find an in- 
finitely small vector č” (P) for the manifold R,(P) with the metric (5.1) 
that satisfy Killing’s equations, then transformations characterized by 
the vector &*(P) are allowed in the manifold. In particular, in the case 
of Euclidean geometry g,,=6,, and Killing’s equations give 


e = 0. (5.18) 








Gee oč” 

ax” axe” OY 
from which 

oH = abx” + a", (5.20) 


where af = — a), and &” are infinitely small constants. Thus the group of 
motions in Euclidean space consists of rotation Ô (a) and translation Î (a). 
From the equations given above, we can see that the metric equation (5.1), 
which defines geometry in the small, fully characterizes the Riemannian 
manifold. It is as if an elementary cell of which the structure of geometry 
as a whole is composed. 

We have already discussed the basic assumptions that dictate the choice 
of the quadratic form (5.1) for the squared interval. In his prominent 
thesis [15], Riemann first showed the connection between the 
mathematical and physical postulates, which acquired fundamental 
meaning in Einstein’s general theory of relativity. Modern physics, as we 
shall see later, stands before the necessity of critically examining the 
geometrical representation that is applied to the microworld. Therefore, 
in the conclusion of this section, it is appropriate to recall the ingenious 
words of Riemann, concerning geometry in the small: 
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The question of whether the assumption of geometry in the infinitely small ıs valid is closely 
related to the question of the internal cause of the origin of metric relationships in space. Of 
course this question also refers to the studies of space, and in these studies we must re- 
member the remarks made above about the fact that, in the case of a discrete manifold, the 
principle of metric relationships is already contained in the very concept of this manifold 
whereas, in the case of a continuous manifold, we must search for ıt in some other place. 
From this we see that the reality which establishes the idea of space forms a discrete mani- 
fold, or that we must try to explain the origin of the metric relationships by something 
external, by binding forces which act upon the reality. 

We can hope to find the solution of these problems only when, starting from the con- 
temporary conception which has been proved experimentally and is based on the 
foundations laid by Newton, we begin refining it successively being guided by facts which 
cannot be explained by it. The investigations like those performed in this work, namely 
which start from general concepts, serve only as a tool that does not allow the limitedness 
and long standing prejudices to hinder progress in the understanding of things. 

We stand here on the threshold of a region that belongs to another science, to physics, 
and now we have no indication how to cross it. [15]* 


6. THE PHYSICS OF ARITHMETIZATION OF THE 
SPACE-TIME MANIFOLD 


The assignment of specific coordinates to physical events is not only a 
mathematical process. The issuing of a passport in itself implies definite 
manupulations with material objects which are necessary for defining 
the location of an event in space and time. 

The classical method of arithmetization was that of using a uniformly 
running clock (“rigid” time) and a rigid scale. In this measuring system, 
however, the problem of agreement between the clock and the scale, 
which move with respect to each other, required the use of signals whose 
laws of propagation had the greatest generality and simplicity. This 
problem became particularly poignant when applied to bodies moving 
with high velocities. (Historically, these bodies were electrons whose ve- 
locities approached the speed of light.) The signals which comply most 
naturally with the conditions established above are light signals. The 
velocity of such signals proves to be a limiting velocity for all material 
bodies and fields. In particular, the velocity of a signal propagating in a 
solid is significantly smaller than the speed of light c. Therefore, we do 
not have the right, within the limits of accuracy required by the theory of 


t Weyl, in his article dedicated to F. Klein, notes that possibly in the foundations of 
geometry lie discrete manifolds ‘*... now, less than ever before, is it possible to affirm that 
in the foundations of geometry there do not lie any discrete manifolds” [16]. 
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relativity, to consider a solid rod as absolutely rigid. It is necessary to 
keep in mind its elasticity and ability to get deformed. We may formally 
state that the theory of relativity excludes the possibility of the exitence 
of absolutely rigid rods. Therefore, according to the concept stated by 
Weyl [17] and developed by Martske [18], we shall not use rigid scales, 
but rather pulses of light (scintillations) for measuring both distances 
and time intervals. 

A light clock may be built from two mirrors that face each other and 
do not absorb light. If a pulse of light is trapped between two such 
mirrors, it will be reflected alternately from one mirror to the other (see 
Figure 9). 

The number of reflections N may be taken for the measure of time. 
Such “pulses of light” are of course optional. We could have taken any 


A B 
7 X 
L 
J 
A B 
l 





Fig.9 A light clock. 4 and B are mirrors, L ıs a light pulse. 
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other periodic process, for example the reflection of sound, or the uniform 
rotation of a gyroscope. We note that in any case we do not have an a 
priori basis for being convinced of the strict uniformity of motion of any 
selected clock. The irregularity of motion of a clock could be discovered 
only in the case that an irregularity were observed in the motion of all 
phenomena. Therefore, it would be natural to suspect the clock of irreg- 
ularity as the cause for any observed changes in the rate of change of all 
phenomena. By selecting other clocks, which are closer to the ideal — in 
other words, less dependent of the external situation — we would eventual- 
ly become convinced that the previous clock “misbehaved”. 

The light clock with mirrors has the advantage, which has been 
sufficiently well substantiated experimentally and is in agreement with 
the principles of the theory of relativity, that the speed of light c does not 
depend on the motion of the source of light (in the above case, the mir- 
rors) nor on the motion of the observer. Thus the uniformity of motion 
of the light clock is based on very general principles, and the nature of the 
mirrors and the possibility of changes in their composition is immaterial 
as long as the separation of the mirrors does not change and their absorp- 
tion of light remains negligible. 

The absence of dispersion of light waves in vacuum is an important 
condition for the use of light pulses in light clocks and as signals for 
determining the metric to be possible. Let the light signal at the time 
t=0 be given in space in the form @(x, 0). We will write it in the form 
of superposition of plane waves 


ọ (x, 0) = | C (k) e** dk. (6.1) 
k>O 


This signal at time ¢ will have the form 
(x, t) = [ew etx- dk = ọ (|x| — ct, 0) (6.1) 


where w is the frequency, œ= |k], and c is the speed of light. This shows 
that the signal has its own form and does not spread out in space. In other 
words, the dots and dashes of the Morse code do not run into each other in 
proportion to the propagation of the signal.t 


t This property of light pulses holds even ın the three-dimensional case (see Section 17). 
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We shall now turn to a conceptual experiment for the definition of the 
interval between phenomena. In Figure 10a, the ordinate is time and the 
abscissa, the spatial coordinate x. The line AA’ is the world line of the ob- 
server A. He has two mirrors at his disposal that form a light clock, the 
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Fig. 10. A method of measuring the interval 54, between events A and B using the reflec- 

tion of a light pulse. (a) A tıme interval; (b) a spatial interval AA’ ıs the world line of the 

observer A; CC ‘1s the line of the second mirror of his clock, and BB’ ıs the world line of 
event B. 
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first having the world line AA’ and the second, a similar world line 
CC’. The zig-zag line between 4A' and CC’ is the world line of the light 
pulse that is alternately reflected from mirrors A and C. Each break in this 
line can be used as a measure of time. Let the world line of another body 
B be BB’. In order to define the intervals s,, between the events A and B, 
we send a light pulse from A to B, where we place a mirror which sends 
the signal back to A. It will be sent at the time 7; =t— AB/c by the light 
clock at A, and it will return to A at the time t, = t + AB/c. Therefore, the 
interval s4, between the events A and B will be defined as 


2 3 AB? 
Sap = tita = t -—5=- > 0. (6.2) 
c 


Here t, = N,t and t, = N31, where N, and N, are the number of reflections, 
measured by the light clock, and t is the period of the clocks. In this case 
the interval s4, is greater than zero. Such an interval is called time interval. 

In Figure 10b we show an example in which the light signal from 
A cannot get to the event B. It is not difficult to see that in this case it 
follows that ż4 <0 [see the point A, (0, ¢,) in Figure 10b], and so the 
interval can be written in the previous form 


AB? 
San = tit =? — 3 <0, (6.3) 


but now it is less than zero. In this case the interval is called the spatial 
interval. 

If the speed of light is bounded, then the process of arithmetization we 
considered above produces a division of the manifold of events R,, (x) into 
space S(x) and time T(x) with the help of the light cone 


=t?7-—=0, (6.4) 


where 
r= x? + y? +27. 


Moreover, the definition of the form of the causal relationship of events 
is brought out. Events at the world point P can only influence events 
which are located inside the upper cone T, (P) (see Figure 6) which is 
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defined by 


pit Oi (6.5) 
C 


All events that are located inside the cone of the absolute past T_ (P), 


; 
t+—-=0, (6,6) 
c 


can influence events at P. Events which are located in the spatial region S 


r? 


rr 5 <0, (6.7) 
c 
cannot be related causally (genetically) to events at P. 

In connection with the fact that all geometrical relationships are 
defined with a clock and light signal, the geometry of a four-dimensional 
manifold with signature f= —2, is sometimes called chronogeometry 
(see [18]). 


7. ARITHMETIZATION OF EVENTS IN THE CASE OF THE 
NON-LINEAR THEORY OF FIELDS 


This section should serve as an illustration of the principle of dividing 
the manifold of events R(x) into space G(x) and time T(x), which was 
presented in Section 3. In the case of non-linear theory of fields, the 
velocity of the signal depends on the magnitude of the field and its 
derivatives. Therefore, the division itself of R(x) into space and time 
manifolds depends to a great extent on the dynamics of the motion of the 
signal. In particular the characteristic cone (see Figure 2) in this case is 
distorted, and for some values of the field and its derivatives, it can 
become imaginary. So as not to make the discussion more difficult, we 
shall consider the field to be scalar, and restrict it to two dimensions, 
Xx xg ah 

As a basis for such a theory, we shall take the variational principle 
with the Lagrange function that depends on the invariants formed from 
the field g(x, t) and its first derivatives p=0@/0t, q=0o/0x.' Such in- 


t This theory presents the scalar version of non-linear electrodynamics of Born and 
Infeld. which they developed in the 1930s [19, 20]. In the discussion of the scalar theory. 
we follow article [21]. 
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variants are 


E (7.1) 
K =}(p’ — 9’). (7.1') 


The Lagrange function # is equal to 
£=L(K,), (7.2) 


and the principle of variations states that 
ôL = foz dx dt =0. (7.3) 


It is obvious that the theory in this form guarantees invariance with 
respect to the Lorentz transformations. Performing the variation, we 
note that 


PA ae rae ô (74) 
Fae POR WON ts, OO 


as usual, allows the surface terms of the integral (7.3) to vanish. 
Equating the coefficients of dg to zero and using (7.3) we obtain 











a (aL \ 2 2% 04 re 
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or 
ay ay ay 
2B R=0, 1.6 
ae oe ae ere 
where 
A=(1+ap’), B=apqg, C=(1—aq7), R=y—2BK 
(7.7) 
and 
PL lav ee lav a aL 
a= — + = $ Yy sz p (7.8) 
ôK?| ôK ôK ôI| ôK ôI | ôK 


In the case when ¥ is linearly dependent on the invariants K, I, when 
a= ß=0 and y=constant, we get back to the linear Klein’s equations for 
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a scalar field g. The characteristic curves of Equation (7.6) are determined 
from the equation (see, e.g., [22] and [23]) 


AE — 2BE4+C=0, (7.7) 

where 
dx 
~ dt 


is the velocity of propagation of the wave front. We call this a weak 
discontinuity (as we did earlier in Section 3). From (7.7) we obtain 


+./1+ 20K — apq (79) 


ae 1+ ap? 


c (7.8) 


or, for small a (K, J) 
ESETE A pay i (740) 


If we now suppose that, in the region a<x<b for t=0, there is some 
initial state @, p, g=C@/0dx, then, from (7.9) and (7.10) we see that it will 
propagate with a characteristic velocity č, that differs from the speed of 
light c which is set equal to +1 in this section. Two possible cases of 
field propagation are shown in Figures | la and 1 1b. In (a), the velocity of 
propagation |¢|<1 and in (b), |€|>1. In the latter case the non-linear 
signal violates relativistic causality and arithmetization of the manifold 
of events has to be reexamined. 





Fig. 11. Properties of a nonlinear field. (a) |č|= |dx/dz|< 1; (b) |€|=|dx/dz|>1. 
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It is an interesting fact that in non-linear theory it is impossible to 
rule out the situation in which, for fixed values of ọ, p and q, the char- 
acteristic curves become imaginary, so that equation (7.6) becomes 
elliptical. In this case the concept of the cause of successive events loses 
its meaning and we shall have to deal with a “lump” of events which 
mutually cause each other but do not follow one from the other [21]. If we 
assume that such situations take place somewhere inside particles in the 
region of strong fields, then further away from the particles, the equa- 
tions should become hyperbolic. On the surface where this transition 
takes place, the discriminant 


B? — AC =1+4 20K (7.11) 


vanishes. The change over from elliptical to hyperbolic equations is 
accompanied by a discontinuity in the first derivative of the field ọ (p or 
q or both) (see [10, 23]). 

In conclusion we note that the theory of non-linear electromagnetic 
fields is closely related to the theory of scalar fields. In this case the 
invariants K and J have the form 


K= Le — #7), 1=(ER), (7.12) 
l 


where & and # have the same sign as the electromagnetic field strength. 
Varying the electromagnetic potential A and @ so that 


OA 
Gao Mdg, HA = rota, (7.13) 
t 


and using the variational principle (7.3), we obtain the equation for the 
electromagnetic field 


P = rotB, divD =0, (7.14) 
where D and B are the electromagnetic inductions 

D=Mé&-NH, BMX +N, (7.15) 
and the quantities M and N are equal to 

T 0g OL (7.15") 


—, N=—. 
0K’ ôl 
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The second group of equations 


OH 
a te, div W =0 (7.16) 


are obtained directly from (7.13). For the one-dimensional case, &(é,, 
0, 0), WO, H,, 0) and &,=€(t, z), H, =Æ (t, z) we obtain from (7.14) 
and (7.16) 


E Oa 60 ae 
pee OB CA 
ae ARAE a RLE Tag 
0H 
+ (1 — 0?) — =0, (7.17) 
Oz 
06 ÔE 4 
az ott 


where, as before, « = 0° L ôk? /0 Z /ôk. The equation for the characteristic 
direction €=0z/dt will now have the form 


(1 + a8?) E + 20 WE + (1 — oH?) = 0, (7.18) 


from which we obtain 


poe Taek ee (7.19) 


1 + ag? 





or, for small « 
E=t1F4a(O+ H+. (7.20) 


From the above, we can see that the non-linear field theory allows the 
existence of signal velocities which can be either smaller or greater than 
the speed of light c, or even be imaginary. Thus the theory of relativity, 
which assumes a constant speed of light in a vacuum, implicitly assumes a 
linear theory [24]. The subject of signal propagation in Born’s non-linear 
theory is considered in detail in [25]. We shall return to Born’s theory in 
Section 43 in connection with quantization of space-time. 


8. THE GENERAL THEORY OF RELATIVITY AND 
ARITHMETIZATION OF SPACE-TIME 


In the general theory of relativity the metric of the Riemannian manifold 
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of events becomes dependent on the motion of matter. The coefficient 
Guy(X) in the metric form that defines the interval between close events, 


ds? = g,, (x) dx" dx’, (8.1) 


in the general theory of relativity is a functional of the energy-momentum 
tensor for matter T(x) 


Guv (X) = Gyy {Tuy 2} - (8.2) 


This functional relationship is defined in terms of the well-known Ein- 
stein equation [13, 14, 26]. 
x 
Ryy az $ IuwR = ce Tav (x), (8.3) 
where xk =87y/¢? =1.86 x 10727 cm gm“? is Einstein’s constant,’ R,, is 
the curvature (see Section 5). 

Two additions have to be made to Equation (8.3): the first, an equation 
of motion of matter (e.g., the equation of relativistic hydrodynamics if 
the matter can be represented in the form of a fluid, or the electromagnetic 
field equation if the matter is a field, etc.). The second addition is a bound- 
ary and initial condition. 

The general theory of relativity allows the same general coordinate 
transformation as the Riemannian geometry (see Section 5), namely, 


x" = f'(x), (8.4) 


where the function /*(x) maps R,(x) onto ®,(x’) in a continuous and 
unambiguous way. The great arbitrariness contained in this transforma- 
tion must be used very carefully. The formal equally of various coordinate 
systems may, in fact, reduce to almost zero by the particular features 
inherent in the problem itself. 

The mathematically arbitrary choice of coordinate system has to 
be checked against other methods of implementing the selected co- 
ordinate systems by physical methods. In other words, we must show 
practical methods of measuring intervals in a manifold of events. 

A basic property of Riemannian geometry, which is pseudo-euclidean 
in the infinitely small, suggests that the method of defining intervals 


t Where »=6.67 x 1078 cm? gm~! s~? ıs Newton’s gravitational constant. 
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between events A and B that were discussed earlier can be carried over 
into the general theory of relativity only if the events 4 and Bare infinitely 
close to each other. Let the square of the interval, in a selected coordinate 
system, have the form 


ds? = J44 (dx*)? + 2945 dx’ dx” A Irs dx" dx*, (8.5) 
where s, r=1, 2, 3. For a light signal ds? =0, so that 
Ide J= dx? Gye 


944 J44 





dxt = dx" dx’, (8.6) 


J44 


where the two signs + corresponds to the direct signal from A to B and 
the return signal from B to A. 

Let us now return to Figure 10 (p. 31) and consider the events A 
and B as infinitely close. The coordinates of the event A are x,=x4,=x} 
=x4=0 and the coordinates of event B are dx§, dx}, dx}, dx}. We 
obtain the characteristic time of A from (8.5), assuming dx*=0: 


ds = dt = ./g44(dx*). (8.7) 
From Figure 10 we have 
dt; = \/gaa(dxg—dxt), dtz = \/daa (dxg — dx3), (8.8) 


where dx? is the change in dx* corresponding to the arrival of the signal 
from A, to B, and dx is that for the same signal from B to A. These 
quantities are defined from (8.6). 

By multiplying dt, and dt, we obtain 


ds4p = dt, dt, = g44 (dX$)? + grs (8.9) 


This is a generalization of Equation (8.5), for the time interval 5 4,. 
Similarly, for the spatial interval, we obtain 


— dsłg = — dt, dt, = — gag (dx})? — g,, dx" dx’. (8.9’) 


By assuming that dxg=0, we obtain an expression for the interval 
between 4 and B 


di? = dt, dt, = — g,, dx, dx’, (8.10) 


dl = ./— g,, dx, dx’. (8.10’) 


or 
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The interval between the two points A and B separated by a finite interval 


B 
w= fy — grs Ax" dx’, (8.11) 
A 


depends on the integration path. We can define the “true” interval as the 
interval along the geodesic line 


dx” dx* 
= 8.12 
ISB [ee Irs do do do ’ ( ) 


The integral (8.12) is taken under the condition that the coordinates of the 
curve x*(a) satisfy the equation 


d?xs dx’ dx* 


Oe ON 8.12’ 
do? | ag do ore) 


According to the basic Einstein Equation (8.3), in the presence of 
matter the curvature of space differs from zero, and in the general case, 
because of the motion of matter, it depends on time. Therefore, the 
problem of arithmetization of manifolds of events in the general theory of 
relativity becomes very complicated.* 

The most natural method of arithmetization of events seems to be a 
sequential arithmetization from one point to its infinitely close neighbor. 
By using a light signal, we can define the separation of point 4 and its 
neighbor Band synchronize the clocks at A and B. We shall, by definition, 
consider as simultaneous the moments B(x, t) and A,(0, t) which cor- 
respond to the average of the signal leaving point A, and the reflected 
signal returning to point A, (see Figure 10): 


Jaq Ax* = (dt + V 944 dxf + dt, + Visa dx3) 
=4(dt, + dtp) — gas dx°/s/Gaa- (8.13) 


If event A, is placed at this point, then dt, = —dt., and the difference 
in time x* between two “simultaneous” events A and B will be given by 


dx$ = — gas dx°/g44 (8.13) 


t For details, see references [14] and [18] 
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This is the value by which the reading of the clock B should be changed. 
We note that the measurement was made using the metric at point A. 
We could also have made the measurement using the metric at point B. 

In view of the fact that the intrinsic time at points A and B flows 
differently, the results of the interval measurements will also be different. 
For small separations of A and B, this difference will only show up in 
second-order terms. In fact, the difference in the interval, as measured 
at A and B, is 


Asn = [Guy (B) — Gyy(A)] dx” dx” 





ô 3? 
| dx* + aaa dx* dx? t dx” dx”, (8.14) 

By a suitable choice of coordinate system, the quantity in the square 
brackets, which contains the first-order differentials, can be made equal 
to zero. And thus the intrinsic difference in the interval measurements 
made at points A and B can be made to be quantities of the fourth order. 
This remaining difference can be related to the curvature of space, and so, 
in general cannot be removed by choice of coordinate system. 

We now return to the question of arithmetization of a manifold of 
events. We saw that, at least in the neighborhood of a point, the arith- 
metization that is characteristic of the special theory of relativity is 
preserved in the general theory of relativity: The whole matter reduces to 
measuring intervals in time and to the sending of light signals. 

Thus a natural expression for the interval is pseudo-euclidean 


ds? = c? dt? — (dx? + dx} + dx), (8.15) 


where c dt =dx4, dt is the proper time and dx,, dx2, dx, are the directions 
along the axes 1, 2, 3, measured with a light signal. In the coordinate 
system in which the interval has the form (8.15), the value of the coordi- 
nates coincides with the numerical value of the intervals in time and space. 

In principle, therefore, we have two possibilities. We can assign more 
or less arbitrary numbers to events (satisfying only topological con- 
tinuity) and then calibrate the system using a clock and light signals, i.e., 
relate these numbers to values of intervals between infinitely close events. 
The other method corresponds to the above. Time and space intervals 
between events are obtained, and then these measurements are “forced 
into” a suitable coordinate system. 
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9. CHRONOGEOMETRY 


Although the arithmetization of events is a priori arbitrary, it requires 
the application of definite physical methods which themselves have to 
be able to encompass a widened sphere of applicability, in other words, 
to encompass both the generality of the method as well as the increased 
accuracy. 

For a long time physicists used an astronomical clock, the rotation 
of the Earth, for measuring time, and the platinum-iridium rod, the 
“standard meter” as a measure of length. 

In recent years, standard time and length were borrowed from atomic 
physics, the most accurate clocks being atomic or molecular, and the 
wavelength 4 of the red-orange line of the inert gas krypton (fKr) is 
accepted as the standard length. The length of the standard meter is equal 
to 1,650,763.731. Atomic and molecular clocks owe their origin to the 
development of technology of atomic and molecular beams and to the 
practical application of induced radiation [18]. A beam of atomic cesium 
gives a very narrow line and this guarantees the stability of the frequency, 
(the frequency being approximately 9000 MHz) to 107+? for several 
months. The hydrogen atom also possesses such stability (its wavelength 
is about 21 cm, frequency 1420 MHz). Details about the design of atomic 
or molecular clocks may be found in reference [27]. The speed of light in a 
vacuum may be used instead of a scale for measuring length. The most 
accurate modern data [27] cite the value as 


c = 299,792.5+0.4 kms"! 


The use of clocks as standards for arithmetization of the space-time 
manifold must be considered as the most perfect and universal method of 
arithmetization that the state of modern physics offers. Thus geometry, 
which lies in the foundations of modern physics, is called chronogeometry. 
This name stresses the fundamental role played by the light signal, and 
time. There is no reason for thinking that geometrical measurements in 
the macroscopic world will cease to interest physicists and mathemat- 
icians. The problem of arithmetization in a non-stationary world 
obviously still awaits its resolution. An even deeper problem arises in the 
case when the variations have a stochastic character, so that the metric 
tensor g„, becomes a random quantity. 
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We shall not, however, go into these problems of the large world. 
On the contrary, we shall return to the main theme of this monograph, to 
the geometry of the microworld. We shall proceed from the assumption 
that at least in a stationary macroworld, or in the bounded regions of a 
non-stationary macroworld, the principles of arithmetization in physical 
events are sufficiently clear, and it is understood what the symbols x, y, 
z, t mean for macroscopic events. Chronogeometry underlies the under- 
standing of their physical meaning. 

The principle of chronogeometry will be used throughout this mono- 
graph in explicit form, or in the form of an implicit assumption, as a 
basis for considering geometrical measurement in the microworld. 

Next, we shall proceed to the problems of the microworld, and see the 
gradual growth in degree of abstraction in the use of the concepts of the 
space-time coordinates x, y, z, t in the world of elementary particles. We 
must prepare ourselves for the fact that this growing abstractness is close 
to denying the physical meaning of these variables that we are accus- 
tomed to considering as space-time coordinates. 


CHAPTER II 


GEOMETRICAL MEASUREMENTS IN 
THE MICROWORLD 


10. SOME REMARKS ON MEASUREMENTS IN THE MICROWORLD 


Standard courses in quantum mechanics like to stress that the measuring 
instrument affects the microobject being measured. However, a more 
important fact in understanding the actual situation in the microworld 
is that the microobject, with absolute certainty, will affect the macro- 
scopic instrument. 

It is important to note that the measuring instrument has to be an 
unstable macroscopic system, because the microphenomenon could only 
disturb such a macroscopic system from its state of unstable equilibrium 
and thereby produce a macroscopic phenomenon. This macroscopic 
phenomenon can then be used to measure the state of the microsystem. 

We can represent the situation in quantum mechanics schematically 
with the aid of the symbolic equation 


M=M+(A+D), (10.1) 


where W denotes the whole macroscopic situation in which the micro- 
system p “lives”, and M denotes those parts of the macrosituation which 
determines the initial state of the microsystem u. The recurrence of micro- 
systems u that are embedded in one and the same macroscopic situation 
M, forms the quantum ensemble St: 


R=M+yM+uM+y... (10.2) 


The part of Mt in parentheses, (4+ D), is formed by the macroscopic 
instrument which consists of an analyzer A and a detector D. 

We assume that the part (A + D) of the whole arrangement does not 
exert a substantial influence on the formation of M [otherwise the instru- 
ment (A + D) could not be separated from M}. This fact is also necessary 
if we are to be able to exchange one instrument (4+ D) for another 
(A+ D} that is capable of measuring the state of the microobject by other 
measurable parameters of this microsystem. The analyzer A sorts the 
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microsystems according to state. The purpose of the detector D is to 
register the presence of the microsystem in some or other state. The 
detector is an unstable macroscopic system capable of changing its state 
under the action of the microsituation. An example to illustrate this 
system is given in Figure 12, where S is the source of particles and C is a 
slit which selects particles having a certain direction. This source and slit 
form the part M of the situation which determine the original ensemble. 

















Fig. 12. Illustration of a quantum mechanical measurement. The source of particles S 
and the slit C form the part M of the macrosituation W which defines the initial set 
Yo, M=S+C. The analyzer is the diffraction grating AA’. It resolves %o into beams with the 
momenta Yp, Ypo Wp,,---. The Geiger counters l4, T2, T3... form the detector D so that 
D=IF,+1,+13+-- Finally, M=M+(A+D). 


For simplicity we assume that it can be expressed in the form of the wave 
function ¥,.' The diffraction grating AA’ resolves the beam into a 
spectrum of various momenta p of the particles of the original beam. 
Mathematically this implies that the original wave function ¥, describes 
the state of the original ensemble of particles that is resolved by the 


t In other words, we consider this ensemble as “pure” (see [28]). 
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grating into the spectrum 
Po =) C,H,» (10.3) 


where y, is the state having a certain momentum p,, and C,, is the am- 
plitude of this state. Finally, the detector D is shown as the collectors of 
microparticles T1, l2, ..., Ta. Such collectors could be Geiger counters. 
When a charged microparticle is incident on one, it causes a cascade of 
electrons which excite a discharge in the gas in the appropriate counter. 
The Geiger counter, being charged, forms an unstable macroscopic 
system which registers the appearance of a microparticle in one of its 
possible states Yp, Yp- 

This example illustrates the fact that all information on phenomena 
in the microworld comes to us through phenomena in the macroworld by 
the evolution of some unstable macroscopic phenomenon.' 


11. THE MEASUREMENT OF COORDINATES OF 
THE MICROPARTICLES 


Non-relativistic quantum mechanics formally allows the exact meas- 
urements of the coordinates of a microparticle x, y, z to be made in any 
manner at a given moment of time t (28, 30]. 

As all physical quantities, the coordinates x, y, z are represented 
by the Hermitian operators £, , 2 in quantum mechanics. These oper- 
ators, together with the corresponding operators Py, py, Pz, which are 
projections of the momentum on the Ox, Oy, Oz axes, are subject to the 
well-known laws of commutationt 


[$ P] = ih, [ip] =ih, [2, P] = ih, (11.1) 
[K 9] = (9, 2] = [2,2] =0, (11.1) 
[Êx By] = Lô,, ĝ-] == [B.. by] =0. (11.1”) 


In contrast to the spatial coordinates, the fourth coordinate, time f, is not 
described by any operator in quantum mechanics (non-relativistic or 
relativistic), and remains a classical quantity (parameter): Therefore, in 


t This point of view of measurements in the microworld was successively carried out in 
monograph [29]. 
t Here, as usual, [A, B]= AB— BA. 
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quantum mechanics the time ¢ is not treated in the same way as the spatial 
coordinates x, y, z. This disparity in time and space is preserved in 
relativistic quantum mechanics. This fact should not be surprising be- 
cause the characteristic “equality” of time ¢ and coordinates x, y, z in the 
theory of relativity does not surpass the property which is expressed by 
the equation 


s? = Pr — (x? + y? + 27). (11.2) 


As soon as we turn to real physical events the role of time becomes 
evident in a perfectly clear manner. In fact, the more important problems 
in physics are formulated thus: From the initial value of dynamic 
variables, determine their value at later times. In other words, given 
the values of the variables at t=0, we need to find values for t>0 (or 
t<0). This problem in mechanics arises from the very essence of the 
laws of dynamics. In the theory of fields the corresponding problem is 
that of Cauchy. By the very nature of the laws of dynamics, we cannot, 
for example, correctly formulate the problem “given z, y and z for x=0, 
find their values for x 40”. This important circumstance is expressed in 
quantum mechanics by the fact that in the space of the variables x, y, z, t 
[the space R,(x)] there are no connections between these variables, 
whereas in the space of conjugate momenta py, Py» Pz, P,=E" [the space 
R,(p)], there are connections arising from dynamics. 

Thus in the non-relativistic theory, in the absence of external fields 


E=—, = 
m P (Px> Pys P2)» (11.3) 
where m is the mass of particles. Correspondingly, in the relativistic 
theory 


E =+ P +m. (11.3) 


By virtue of the relationship (11.3) or (11.3’), the space R,(p) reduces to 
a space of fewer dimensions, R, (p). In the relativistic case (11.3’) the 
space is that of Lobachevski (see Section 31). 

In quantum mechanics, the relationships (11.3) and (11.3’) can also 


+ The fourth component of the momentum p, is the energy E. 
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be considered as corollaries of Schrédinger’s equation 
ih — = AY, (11.4) 


where ¥ is the wave function and A the Hamiltonian operator. This 
operator is a function of the momentum operators px, Py, p,. The form of 
the Hamiltonian A defines the dynamics in quantum mechanics. In 
particular, we note that the relativistic Dirac equation, in which time is 
absolutely symmetric with the spatial coordinates (see Section 14), can 
also be written in the nonsymmetric “Schrödinger” form (11.4). 

For stationary states, the time dependence of the wave function ¥ is 
contained in the factor e~‘#/*, In this case (11.4) gives 


EY =A, (11.5) 


That is, the possible values of energy E (or of the fourth component of 
the momentum P, = E) are the eigenvalues of the Hamiltonian operator 
H. They can be discrete whereas the “conjugate” fourth coordinate t is 
continuous. Because of this, there does not exist a relationship [31, 32] 


[ô] = ih (6, =Ê), (11.6) 
similar to (11.1), (11.1), (11.1%). And, therefore, the uncertainty relation 

Litst h 

AE? At > ze (11.6) 


corresponding to that for x and p, 


—— K 

ax? Ap? > =, (11.1) 
does not exist which results from (11.1). Instead of (11.6’), we have the 
relationship 


as 2 


h 
AEP? > 7, (11.7) 


where t is not the statistical dispersion of time, but the interval of time 
(fully defined) in which the average value of a dynamic variable L (which 
is not an integral of motion) changes by AL, compared to its statistical 
deviation (4Z’)!/? [29, 33]. 
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Let us now turn to measuring the characteristics of the operator of 
the coordinates x. It is sufficient for our purpose here to limit ourselves 
to one dimension x. According to the basic principles of quantum 
mechanics, the observed value x’ of the variable x is the characteristic 
value of the operator £ which is defined by the equation 


Se =X Wy, (11.8) 
where y is the characteristic function of the operator x belonging to the 
characteristic value x’. The explicit form of Y, depends on the repre- 
sentation selected. In the momentum representation, y, is a function of 
p, and the operator £ is equal to 


0 
K=+ih—. (11.9) 


In the coordinate representation, w,. is a function of x and the operator 
X is equal to 


x. (11.10) 


x 


From Equation (11.8) it follows that, in the momentum representation 


ip,x/h 
Wx (Px) SA (11.11) 


a/2ah 
and in the coordinate representation 
Wx (x) = 0(x — x’). (11.12) 


If we select the normalized functions y,» they are related by the unitary 
transformation 


fee) 


Wa (x) = | S(x| Px) Wx (Px) dPx, (11.13) 


= 0 


where the matrix elements S(x | p,) are equal to 


exp(— ip,/h) 


a / 27h 


It follows from the explicit form of the eigenfunctions of the operator 


S(x| Px) = (11.14) 
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X that, in the state with a certain value x =x’, there coexist states with any 
momentum p,, from —œ to +00, having equal importance, equal to 
|S(x|p,)/? =42h. Therefore, when making an exact measurement of the 
coordinates of a microparticle x, an infinitely large energy must be 
involved in this measurement. 

This necessary energy may be borrowed from the energy of the 
particle itself, or from another physical system that takes part in the 
measurement (e.g., from any part of the measuring instrument). Thus for 
measuring coordinates, the interference of the instrument with the micro- 
particle state is both inevitable and essential. When considering this inter- 
ference, it is better to speak, not of the measuring the microparticle co- 
ordinate x, but of localizing [30] the microparticle about the point x= x’ 
as a result of its interaction with the measuring device. 

The state w,. which arises from the localization of the particle quickly 
diffuses, so that the average value of the square of the deviation 
Ax? =(x—x)? will grow more rapidly with the passage of time. 

In order to investigate the diffusion of the state obtained by localiza- 
tion at time f=0 and point x= x’, we shall consider the function y, (x) as 
the limit of the sequence of the function Y» (x, a) as a0: 


exp (— (x — x'/a’) 


Sui 


2 : e 
exp (— Fog + Be Dap, (11.15) 


Yx (x, a) = 





~ nh 
The quantity C(p,)=Grh) exp[—(ap,/4)? —(ip,x'/h)] is the same wave 
function, but written in the p, representation. For a0, we have 
lim y» (x, a) = 6(x — x’). (11.16) 


a>0 
From Schrödinger’s equation 
ôy 


ih — =A Y : 
h (11.17) 


(4 =p2/2m is the Hamiltonian operator for a free particle, and m is the 
mass of that particle); each plane wave W,, (x) =exp(ip,x/h)/(2xh)'? in 
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the superposition of waves (11.15), for t>0, becomes 
_ exp [i(p.x — Et)/h] 


/ 2mh 


where E(p,)=p2/2m is the kinetic energy of the microparticle having 
momentum p, and mass m. Therefore, at the time ¢ the wave function 
(11.15) transform into the function 


Yp, (x; t) (11.18) 


1 ap? ip,(x—x') ipit 
EE ER E Se — Pet) ay 
Yes a => | »( 4 h mh 


a (11.19) 





This integral may be evaluated when 


2 it 3 ed et 
Z = (Ap, + B), v=(4 ies ), e 





4 2mħ 2ħA 
(11.20) 
Evaluating this integral gives 
Z oa 1 — B2 
We (x, a, t)“ = ate eae (11.21) 
From this form, it is easy to obtain the width of the packet 
es 2 
Ax? (t) = a? + Im? t? (11.22) 


and note its growth in time. For a—>0, when Yy (x, a)>ô(x—x'), the 
velocity with which the packet diffuses becomes of the order of 
v~h/ma- œ, which is formally postulated in the non-relativistic theory. 
We have considered the eigenfunctions of the operator 2 which are 
quadratically not integrable. We may now consider a second class of 
functions y, (x, a) which remain quadratically integrable, namely, 


_ exp[=(& — x')?/2a?] 
Irsa 


This type of function tends to (5(x—x’))'? as a0. 


Wx (x, a) (11.23) 
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For such a function 


fws (x, a)? dx = 1 (11.24) 


for any a. We may readily see that, for the state described by the func- 
tion (11.23), the quantity 4x? is equal to 
Ax? = a?/2 (11.25) 


and for a>0, x?=0. As regards to the operator x, it operates on the 
function y, (x, a) thus: 


xW (x, a) = xY (x, a) + A(x’, x, a), (11.26) 


where 





A(x’, x, a) = Hna [e sy | (11.26’) 
fale 


and for a>0 
A(x’, x, a) > $r a(x — x’) ô(x — x’) = 0. (11.27) 


Therefore, for a0, the function (11.23) satisfies Equation (11.3) to any 
degree of accuracy. 


12. THE MECHANICS OF MEASURING COORDINATES 
OF MICROPARTICLES 


Thus far, the discussion concerning measurements of the coordinates of 
a microparticle x has been rather formal. From this formal point of view, 
the coordinate measurements indicate that some initial states described 
by the wave function W(x) “reduce” to the state 


Wo (x) > Wy (x). (12.1) 


However the realization of the process of localization of particles near 
some point x =x’ is not simple, even in a gedanken experiment. 

Thus it is important to divide measurements into two groups, the 
direct and indirect, as was done by Mandelshtam [34]. In the case of the 
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direct measurements, the measurement itself is realized in the first link 
joining the microobject with the macroscopic instrument. In the second 
case the measurements have one or more intermediate links. 

For example, if a beam of free microparticles is incident on a photo- 
graphic plate and it produces a “spot” on the plate, such a phenomenon 
is called a direct measurement of the coordinates of the free particles. 
However, the dimensions of the spot are not smaller than the dimensions 
of the atoms in the plate. They are in fact larger, because the smallest spot 
has the dimension of the grain size of the plate. Thus the initial wave 
function that describes the incident beam W(x) does not tend to yy (x) 
=06(x—.x’) in such a measurement, but to some function y (x, a) which 
has the dimension a which is greater than or equal to the atomic size of 
the photographic plate. However, theory does not limit the size of atoms, 
which may be very small, and so there is no inconsistency within the 
theory. 

As an example of an indirect measurement, we may consider the 
measurement of the coordinates of an electron within an atom. In 
principle such a measurement can be made by observing the scattering of 
a narrow beam of microparticles from a bound atomic electron. The 
scattering of these particles can be registered directly on a screen, and be 
related to direct measurements. The narrow beam of particles which is 
used for the probe forms an intermediate link. Therefore such a measur- 
ement is called indirect. 


A. Direct Measurement of Coordinates of a Microparticle 


We shall now turn to the mathematical side of the problem and consider 
the interaction of the microparticle and the atoms in the photographic 
plate [29]. Insofar as we are interested in the basic points we will make 
several simplifying assumptions which are compatible with the principles 
of the theory. We consider the photographic plate as a system of infinitely 
heavy atoms whose location in space is known exactly. We assume that 
the primary microphenomenon which “initiates” the photochemical 
reaction consists of the excitation of the atom in such a way that its 
electron goes from the state W(x) to the state w,,(x) where x is the co- 
ordinate of the electron in the atom. Ifa is the dimension of the atom, then 
Wo and w, disappear rapidly for |x|>a. We denote the coordinates of 
the microparticle by Q and assume that the energy of interaction of the 
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microparticle with the electron has the form of a 6 function 
W = go(x — Q), (12.2) 

where g is the interaction constant. Let the excitation energy of the atom 
be £= £, — £o, and the energy of the microparticle be E=p?/2m where m is 
the mass of the particle, p its momentum before the interaction and p’ its 
momentum after the interaction with the atom. We assume that E>e. 
Therefore the microparticle is the main source of energy which is nec- 
essary for its localizing. The change in excitation Ey) E,, which is local- 
ized exactly in the region around the atom plays the part of the analyzer A 
which sorts the microparticles according to their coordinates Q to an ac- 
curacy of a, the atomic dimension. In further excitations the atom can 
initiate hidden transformations in the sensitive grain of photoemulsion. 
These, or similar processes, play the part of the detector D which destroys 
the interference of the various states of the microparticles. We shall not 
go into details on the design of detector, and limit ourselves to the design 
of the primary function of the measuring device, that of analyzing 
particles by coordinates. In our case, the macroscopic nature of the ana- 
lyzer A is expressed in the assumption that the mass of the atoms in the 
photographic plate is infinitely large so that their location in space may 
be defined exactly. 

At the initial time t=0, the wave function of this system (microparticle 
p and analyzer A) has the form 


Bo (x, Q) = Wo (x) e22, (12.3) 


This function describes the atom in the unexcited state and the micro- 
particle in the state of a plane wave having momentum p. For t>0, we 
write the wave function in the system as 


® (x, Q, t) = By (x, Q) el? + G(x, Q, t), (12.4) 


where the frequency @)=(E+€ )/h. We shall consider the function 
v(x, Q, t) as small and compute its contribution using perturbation 
theory. According to this theory (see, e.g., [28], Chapter XI), 


t 
1 i i 
7 | af 2 ih | dre'®™ (np' |W] Op) yn (0) 2, 
n o0 (12.5) 


GEOMETRICAL MEASUREMENTS IN THE MICROWORLD 55 


where (np’|W| Op) is the matrix element of the energy of interaction, 
taken between the initial state p(x, Q) (12.3) and the excited state 


®, (x, Q) = Wr, (x) el”. (12.3’) 


This matrix element is equal to 
(np' |W| Op) = fo: (x, Q) W (x, Q) Pò (x, Q) dx dQ 
= of ®, (Q, Q) Po (Q, Q) dQ 
= a (Q) P= dO = ghno(Q)s q=p- P. 
(12.6) 


Here p o(Q)=Y, (Q) Y (Q) and p,o(q) is the Fourier component of 
Pao (Q). Further, Q, is the frequency of the excitation 
ae (p?/2M + £o — p’?/2M — e) 
n h . 
If we consider small t+: 0<t<1/Q,, then 





p(x, 0, t) = > | dp’ = Bno (q) ÈP y) t 
= »4 Wn (20) Pro (Q) ÀL £. (12.7) 


In order to make this result clearer, we assume that in the sum of excited 
states only one term is important and for the others pmo %0 where m#n. 
Then we have 


P(x, Q, D = 1 U(X) Pro (Q) È (12.8) 


From this we may easily see that, for small t, soon after the interaction 
the wave function of the microparticle is 


Yn (Q) = Pno (Q) e'?? (12.9) 


and accurately reproduces the transitional density of the electron in the 
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atom p„o (x). This function is the wave packet with a probability distribu- 
tion for the coordinates of the microparticle 


W, (Q) = |Pno (Q)? (12.10) 


and with an average momentum of 
ô 
p= ii | Qa ,(Q) dQ ~ |p|. (12.11) 


(where the symbol ~ indicates that equality is satisfied to an accuracy 
of the electron momentum, which has the order of magnitude of h/a <p). 

In this manner, the act of exciting an atom is accompanied by the 
localization of the microparticle in the wave packet (12.9), and so the 
excitation of the atom can be considered as the transition of the particle 
from the state of a plane wave e!?” for t=0 to a localized state w,(Q) 
which is defined in (12.9) for 0<t«1/Q,. For t-00, the wave packet 
diffuses. However, if the excitation of the atom initiates a chemical or 
some other reaction, then it must be included in the examination of the 
chain of events leading up to the macroscopic event. In this case the state 
will not tend to a diffused packet for too, but will be described by a 
new state which results from the process of atomic excitation followed by 
the chemical reaction. 


B. Indirect Coordinate Measurements of a Microparticle 


We shall now turn to the indirect measurements of the microparticle 
coordinates. We first consider ray theory which we shall use as a probe for 
measuring the coordinates of a microparticle that is in a bound state, as 
for example an interatomic electron. 

A screen 3—3 with aperture D are shown in Figure 13. The aperture 
forms a packet of microparticles that have an energy E=p?/2m=hw and 
momentum p=hk. Let us superimpose a plane wave that has a certain 
total energy but a different direction of propagation. It has the form 


y= fa (a) exp {ikr [cos v cose + sin v sina cos(g — £)] 
— iat} sina da dB, (12.12) 


where r, v, ọ are the polar coordinates of the radius vector r and a, f are 
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the polar angles of the wave vector k. Integrating over the angle df gives 


y= [4 (a) sin a da exp {ikr cos v cosa} Jo (kr sin v sin a) 
(12.13) 








Fig. 13. Diffraction of a narrow beam. 3-3 is an absorbing screen; D is an aperture with 
diameter a; A, is the width of the beam at a distance r from the screen; 4? =a? +(A?/a? r°. 


[Jo (x) is the zero-order Bessel function]. For v=7/2 we obtain the value 
of the field at the surface of the screen. This field must vanish outside the 
aperture. We can set f(r) = B exp(—r?/a’) in a simple approximation to 
express the opacity of the screen for r >a. In the integral (12.13) only small 
angles give a significant contribution and, therefore, we may approximate 
the integral by 


foe) 


y= o | A(a) a da exp(—} ikr a?) Jo(krsinva). (12.14) 


(0 
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For z=0 we have 


Be"? = | Jo (kra) A(a) ada, (12.15) 
o 


from which t 


223 tee 
A(a) = Bee exp (- sia ). (12.16) 








4n 4 


Substituting this expression for A(«) into (12.14) and using the inverted 
equation (12.15), for large r and small v we obtain the intensity of the 
wave in the beam ||? which is equal to 


2\2 2 
ll? = E ) ap -35 e). (12.17) 


From this equation it follows that a beam of particles originating from an 

aperture of radius r, spreads due to diffraction in such a way that the 

square of its transverse dimension p?=r76? is defined in the quantity 
727? 


4} =>. (12.18) 





If we consider the time evolution of this process assuming r= vt = pt/m, 
we find that 
h? 


A? = 
1 
ma? 


i (12.19) 





which is in agreement with the result we obtained earlier for the transverse 
dispersion of a wave packet. In order that the beam would form a good 
probe, its width 4, must be substantially smaller than the dimensions 
of the system d within which the location of the microparticle is to be 
determined. That is, taking into account the initial dimensions, we have 
the condition that 


Ar? 


a + z 
a 





< d? (12.20) 


t See [35] for an example. 
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or, for a<d, 


d r , 
Fi > = (12.20) 
These conditions are extremely rigid and if applied to some real object, 
they are practically unrealizable. In fact there is no way of forming a slit 
or any other device for producing a beam whose width a<d, if d is the 
atomic dimension. However the theory formally allows the slit to be of 
any width and thus is not internally inconsistent. Keeping this in mind, let 
us consider the interaction of a microparticles to be used as a probe to 
determine the location of the first bound particle. Let the mass of the first 
particle be m, its coordinate x, its eigenfunctions w,(x) and E, the 
corresponding energy level. These functions satisfy the equation 


f n(x) Wn (X) = Ena (X), (12.21) 


where Ĥ„(x) is the Hamiltonian operator describing the dynamics of the 
bound particle. Let the mass of the particles in the beam probe be m, 
their coordinates Q and the momentum operator Ê. Thus their Hamil- 
tonian, written more simply, becomes 
R p’ 
H,(Q)=—. (12.22) 
2u 
As in Section 11, we assume that the interaction of the m-particles and p- 
particles occurs at one point, i.e., 


W (x, Q) = gd(x — Q), (12.23) 


where g is the interaction constant. The wave function of the m-particles 
together with the beam-probe w(x, Q) satisfy Schrédinger’s equation 


[Ân (x) + A, (Q) + W(x — Q)] W(x, Q) = Ey (x, Q). (12.24) 
We write w(x, Q) in the form 
W(x, Q) = yo (X) uo (Q) + X y, (x) u,(Q), (12.25) 
n#0 
where yo (x) describes the state of the m-particles before the interaction, 


and uo (Q) describes the beam of u-particles together with the elastically 
scattered wave. Substituting this function into (12.24) and multiplying 
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scalarly by w*(x), we obtain the system of equations for the function 
Um (Q): 
f? 

F + (Em — | Um (Q) = — gyo (Q) Ya (Q) uo (Q), (12.26) 
We neglected the product W $40 Wn (X) u,(Q) as being of the second 
order of smallness (for small g). For elastic scattering which is described 
by uo (Q), we have according to (12.26) 
— 2ug 

h2 


where k? =2uE/h?°. Using well-known methods? we obtain 


(V? + k?) uo (Q) = lya (Q)? u2 (Q) = p(Q), (12.27) 





uo (Q) = po (Q) + [se — Q’) p(Q’) dQ’, (12.28) 


where the function go (Q) describes the initial narrow beam [see Equation 
(12.12)], and the function u)(Q) describes this beam together with the 
scattered wave. 6(Q—Q’) is the Green’s function of the homogeneous 
equation (12.27) 


— eiklo-o'l 
6 (Q — Q) = ——_.. 12.29 
4r |Q — Q' 
and the quantity p(Q) is equal to 
` 2 
p(Q) =- =F Yo (Q Po (Q). (12.30) 


If we consider large separations between the scattering regions |Q|=r—> 
— oo, then equation (12.28) reduces to the well-known result? 


1 eikr 
uo (Q) = po (Q) + PAEA A(q), (12.28’) 
nmr 
where the amplitude of the scattered wave A (q) is equal to 
A(q) = [este (Q) dQ; (12.31) 


t See, e.g., [28], Chapter 13. 
t See, e.g., [28]. 
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Here the vector q=p’—p is transmitted momentum due to scattering. 

From Equation (12.28) we can see that the source of the wave scatter- 
ing in the region the density p(Q) differs substantially from zero. This 
region is the region of intersection of the beam-probe of -particle 
@o(Q, #) with the region of localization of the m-particle. This region is 
defined by the behavior of the function |W) (Q)|?. 

Consequently, the region in which p (Q) differs substantially from zero 
has definite coordinates in the (x, y) plane to the accuracy of the beam 
width a and an undefined coordinate z which is the beam direction (see 
Figure 14). If we can now record the scattering of the y-particles off the 


7 : - p(Q) 























3 





Fig. 14. Coordinate measurements in an indirect experiment. 3-3 1s an absorbing screen; 
a ıs the diameter of the aperture which produces a narrow beam-probe. 
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atom with some detector, then we will have shown that at the moment of 
scattering, the m-particle had the coordinates x, y which are equal to the 
coordinates of the aperture that defined the beam of y-particles to an 
accuracy of the beam width 4x, Ay xa. We note however that this asser- 
tion would be exact only if the beam had sharp boundaries. In fact such 
boundaries do not exist and there is a finite probability that the p-particle 
is scattered on an m-particle that is located at a distance greater than a 
from the axis of the beam. Furthermore, because of the diffuseness of the 
boundary of the atom, the m-particle may be extremely far from the 
center of the atom, in the region where the beam-probe diffused to such 
an extent that its dimensions 4, >a. 

And so, even in this gedanken experiment in which we assumed a nar- 
row slit and an interaction at one point, the inference about the actual 
coordinates related to the m-particle atom can only be made in terms of 
probabilities. 

The question can be formulated more strictly in the following manner: 
Given a scattered y-particle, what is the probability that it was scattered 
by a m-particle located in some sharply defined region Q(x)? In answering 
this question we note that the quantity which defines the full probability 
of some elastic scattering of -particles off of bound m-particles is the 
integral 


I= [nor dq, (12.32) 


where A (q) is the amplitude of the scattered wave (12.31). The quantity 7 
(12.32) measures in relative units the flow of elastically scattered y-par- 
ticles. 

Using (12.31), this integral can be rewritten in the form 


1 
l=— | p(x’) p(x”) A(q, x’ — x”) d3x’ dx”, (12.33) 
(47) 
where the function 4 (q, x) (x =x’ — x”) is defined by the equation 


sin Q, x sin sin Q3z 
Aq, x) = Qı Qay Q3 ; (12.34) 
x y Z 
and Q,, Q2, Q3 are the maximum values of the projections of the vector 
q on the coordinate axes. From (12.33) we can see that the probability 
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for elastic scattering J cannot be represented as a sum of contributions 
from various spatial regions but is defined as the sum of correlations. 

However, if the wavelength / of the beam-probe is small in comparison 
to the region in which p(x) varies, then the limits of integration for qx, 
qy, q- can be changed to + œ. This is equivalent to exchanging the function 
A(q, x) for the function 6(x). Then in place of (12.33) we obtain 


1 2 3 
I sale (x) d°x (12.35) 


and the probability that the scattering took place in the region Q can be 
written in the form 


1(Q) 





| p? (x) dx. (12.36) 


R 


~ (4x) 


In these calculations we assumed that the interaction of the u-particles 
and m-particles occurred at one point [see (12.23)]. If it did not occur at 
one point then it is, in general, impossible to define the probability 7(Q) 
in regions Q which are smaller than the sphere of interactions. In fact, 
substituting 6(x—Q) for the distribution function of the interaction 
V (x—Q) in (12.23) gives 


W(x — Q) = gV (x — Q), (12.23’) 
and repeating the steps used to obtain Equation (12.27) we obtain 
(V? + k?) uo (Q) = py (Q), (12.27') 


in place of (12.27), where 


2 
Pvy (Q) = “ut | V (x — Q) Iho (I? po (Q) dex. (12.30) 


From this it follows that 


1 
Iy = a | pyx’ Py (x”) A(q, x — x") d3x’ d3x", (12.33) 


or for A-+0, |q] 00 we obtain, analogously to (12.35), 


1 
i= Gn? | pè (x) d3x. (12.34) 
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As before, we may formally introduce the quantity 


I, (Q) = | p2 (x) d?x. (12.35') 


Q 


1 
(4n)? 
However, from (12.30) we can see that the quantity p$ (x) is non-locally 
related to the function |W. (x)|’, so that it does contribute to the integral 
(12.35) even in the regions of space where the quantity |W. (x)|? is equal 
to zero. The interpretation of integral (12.35) can be retained in the case 
when |W9(x)|? is very evenly distributed inside the scale which defines 


the sphere of interaction, that is the region where interaction potential 
V (x—Q) is negligibly small. 


13. INDIRECT MEASUREMENT OF A MICROPARTICLE 
COORDINATES AT A GIVEN INSTANT OF TIME 


So far, the measurement of the coordinate x of the microparticle was 
considered for an undefined moment in time. In order to measure the 
coordinate x in some specific small interval of time At the function of the 
aperture in the gedanken experiment of forming a beam-probe necessarily 
becomes more complicated. We must assume that it is possible to open the 
aperture at some moment in time, say t=0, and to close it at some 
moment t= At. Then, in place of a beam shown in Figure 14, we obtain a 
packet, defined not only along axes Ox and Oy, but also in the direction 
of its motion, along the axis Oz. 

This packet will diffuse according to the equations obtained in Section 
11. Now, in place of a beam-probe of u-particles we shall have a packet- 
probe of u-particles. Such a wave packet-probe can be made up of a 
superposition of plane waves which diffuse primarily in the Oz direction 
(see Figure 14). In accordance with the above, we shall write the wave 
function of the y-particles which we shall use to probe the location of the 
m-particles which are bound inside the microsystem, in the form 


(Et — pQ)\ dp 
Po (Q, t) = fe (p) exp( - l h ) (2rhħ)?!? $ 





(13.1) 


where 
_ exp {— b? [pz — p; — (p. -PI 
= b3/273/4 . 


C(p) (13.2) 
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Here p is the average momentum in the direction of motion of the packet 
along the Oz axis, and the quantity 1 /b? is a measure of the rms deviation 
of the momentum 4P}, AP?, AP?. Using the calculations described in 
Section 11, we find that for sufficiently large values of the parameter b, 
the function po (Q, ft) may be written in the form 








e (@? —ké) 


Po (Q, t) = Grn I (Q, 4), (13.3) 


where the function f (Q, #) is equal to 


se L 
a a | 








f(Q = exp] (13.4) 
In these equations w=E/h; E=p?/2u is the average energy of the u- 
particles; P is their average momentum directed along the O€ axis, k= p/h 
is the wave vector, v=p/p is the average velocity along the O€ axis and 
Aj, 4j are the average widths of the wave packet in the transverse direc- 
tion and that corresponding to the direction of the y-packet. In agreement 
with (11.22) we have 


2 


h 
Doe Bde we 2 
Ai = Aj =a E 


t?, (13.5) 
where a? ~h?/b?. As in Section 12, let the Hamiltonian of the m-particle 
be H,, (x), and the Hamiltonian of the y-particles be Ê, (Q) [see Equations 
(12.21) and (12.22)]. Then the Schrédinger equation for the wave func- 
tion ¥ (x, Q, ż) of the system of “‘m-particles plus the wave packet of 
p-particles” may be written as 


he (A, (x) + A, (Q) + W(x —Q)] ¥, (13.6) 
where W is the energy of interaction of the m-particles and p-particles. 
We shall consider that this interaction occurs at one point [see Equations 
(12.23)]. We now write the function w(x, Q, A in the form 


Y = (x, Q, t) exp( = i 2), (13.7) 


where E, is the energy of m-particles in the state Yọ(x). The function 
®(x, Q, £) may be written as the sum 


S (x, Q, t) = Wo (x) Po (Q, t) + u(x, Q, t), (13.8) 
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where u(x, Q, £) is the scattered wave, and %o(x) go(Q, £) is the initial 
wave. Let us further write w(x, Q, t) as the series 


u(x, Q, 1) = ¥ um (Q, OF, (x) (13.9) 


in the eigenfunctions w,,(x) of the Hamiltonian operator Ĥ„ of the 
m-particles. In the usual manner (cf. Section 12) we find the system of 
equations for the function u, 

Gu, P? 


ih ap = zp Ym + Io (Œ YAO Po (Q 0) (13.10) 


which are analogous to the system (12.26). In particular, for elastic 
scattering, we obtain 
ay 0: È 2 
ih | — — — } uo = g Wo (Q)|4 Po (Q, t). (13.11) 
ôt 2u 


In finding the solution to this equation, we note that the Green’s function 
© of the homogeneous Equation (13.11) is 


a P , , 
ih | —-— — )G = — ô (Q —- Q) ô(t - r’), (13.12) 
ôt 2u 
and has the well-known solution [29,36] 
1 i (Q — or) 
= ——> — -= } for t>fť, 13.13 

t= t)’ exp( h2(t— ty? = ( ) 

6G =0 for t<rť. (13.13) 
Thus the solution for the non-homogeneous Equation (13.11) can be 
written in the form 


œ 


= — p\- 3/2 =~ im(Q a or 
uo (Q. t) = Go (Q, t) vafe t’) exp( io Gah. 
x Wo (Q)? Go (Q, t) BQ! dr’, (13.14) 


where the coefficient « contains all constant factors. 
The function u,(Q, £) describes elastic scattering of u-particles. We 
shall now consider the case when the dimensions of the p-packet, 4, 


t In doing this we again assume that the interaction constant g is small. 
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longitudinal and A, transverse, are large in comparison to the funda- 
mental wavelength 74=h/p. In this case the variation of the index of the 
exponential of the function f(Q, £) (13.4) in the period of the variations 
xh/E is very small, and we can perform the integration over dt’ in 
(13.4), considering the function po (Q, #) to depend on #’ only through the 
factor e~'*"’/" =e" Introducing the symbol t= ¢' —t in (13.4), we apply 
the method of steepest descent to the index of the exponent in (13.4). The 
integral that we must take has the form 


oO 


— iF (t) 
0 
where 
A? 
F(t) = 75. + or (13.16) 


and 4D=2h/u. A* =|Q’ —Q|?. Equating dF(t)/dt to zero, we obtain 


o Ae ce Ie = Ol Qi 
Tm =+ iDo T 7 (13.17) 


The minus sign corresponds to the retarded, and the plus sign to the ad- 
vanced action. This relationship shows that the scattered wave travels 
from the point Q’ to the point Q in a time tm, which is defined by the 
velocity of the u-particle v=ħk/u. Noting that d?F(t)/dt? is equal to 


2 3 


v 
F” EEES ee ne i 
(n= 558 = tio T (13.18) 
we find that 
etikie-al 
(2p m (13.19 
PQQ 


where f is a constant coefficient. Returning now to Equation (13.14) we 
obtain 


elk lQ-Q' —iot 
IQ — Q'I 
Q- 2-2) ag, (13.20) 


uo (Q, t) = Po (Q, t) + xf 
x p(@t- 
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where «’ is some new coefficient, and the function p(Q, £) is equal to 


2 ; 
P(Q, D =F bo (OP Po (Q, 1) e (13.21) 


It differs from the earlier result (12.28) only in the time dependence 
of the wave packet @,(Q, 2). This dependence allows us to speak of that 
moment in time when, by the act of scattering, the y-particle measures 
the coordinates of the m-particle. 

If we consider larger separations as we did in Section 12, i.e., 
\Q| =r oo, then from (13.20) we obtain 


uo (Q, 1) = Po (Q, ) + Ala Q 0), (13.22) 
where the amplitude of the scattered wave A (q, Q, 2) is equal to 

A(q, Q, t) = f ep (o. t— 2-2) d3Q’ (13.23) 
and differs from (12.31) by the retarding time dependence. 


We now calculate the full intensity 7 by integrating |A (q, Q, £)|? over 
d?q, assuming very short waves (4-0), and instead of (12.36) we obtain 





I(Q, Q, t) = ; ip? [e (a. t- 122) d3Q’. (13.24) 
Q 


This equation differs from (12.36) in that the moment of time t' when the 
scattering takes place is ¢’=1—|Q’—Q]/v. Here Q is the radius vector 
of the point of observation, and Q’ is the radius vector of the region Q 
inside of which the scattering took place. 

The quantity /(Q, Q, f) is a measure of the probability that the scat- 
tering of a u-particle observed at a point in the moment t, originated 
as a result of scattering in the region Q in the moment of time t= 


=1—-|Q—-Q'|/v. 


CHAPTER III 


GEOMETRICAL MEASUREMENTS IN THE 
MICROWORLD IN THE RELATIVISTIC CASE 


14. THE FERMION FIELD 


A field of free fermions can be described by a four-component spinor 
operator ý (x). This operator and its conjugate (x) satisfy Dirac’s equa- 
tions 


ô a 
(+m) a0, (14.1) 
= ê 
Y tn gem) =03 (14.1/) 


x,(u=1, 2, 3,4) are the components of the four-dimensional radius 
vector of an arbitrary point P. They are defined by x=(x,, x2, x3), 
X4=Ict, where, as usual, ¢ is present time, c is the velocity of light, and 
y, is the well-known Dirac matrix 


Yay F Vw? pn ET 205, d (14.2) 


The components of the spinor w and # satisfy the laws of commutation 


e) be(x)} =0, {he p(x} = 0, (14.3) 
h(x), Up (x)} = iSyp (x — x’), (14.3') 
a, B = 1, 2, 3,4, 


where the brackets {4B} denote the anticommutation AB+ BA. 
The function S,,(x— x’) is the matrix element of the commutation 
function S(x— x’), and is equal tot 


ô 
Sip (x — x’) = — (>, zT m) D(x — x’), (14.4) 
u ap 


t See Appendix 1, Equations (I.3) and (1.15). 
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where 








D(x) = 1 em sin Et d3 
= nP L p. (14.5) 
In this equation p is the momentum vector and E is the energy which 
is equal to E = +(p?+m?’)!/?, 

If the plane waves e?” (px = px — Er?) are taken as the basis functions 
for expansion, then the spinors ý and f may be written in the form 


(14.6) 
ve Je {âr i (p) e~'* + b, (p) D (— p) e*™}. 


r=1, 


(14.6'/) 

Here u” (p), v’(—p) are the spinor amplitudes of the plane waves. They 
satisfy Dirac’s equations 

(ip + m) u(p) = 0, (14.7) 

(— ip+m)v(—- p)=0 (14.7) 

[the operators p=y,p, and p, = (p, E/c)] and are normalized by the equa- 


tions 
4 


D uz (p) u (p)* = ô, (14.8) 


a=1 
4 
2 vh(— p) og (— p)* =ô, (14.8’) 


The four-component spinor w (p) e?” denotes waves that describe nega- 
tively charged fermions (electrons) with momentum p, energy E= +(p? + 
+m’)!’? and with orientation spinors defined by the index r(=1, 2). The 
wave v'(—p)e '?* describes positively charged fermions (positrons) 
whose momentum is p, energy is E= + (p° +m°)"? and whose spinor is r. 
The operator properties of the spinors ŷ and w carried over to the 
amplitudes â, â+, 6, b+ which are the operators that satisfy the anticom- 
mutation conditions 


{â, (p), ay (p’)} = 6,0° (p A p’) š (14.9) 
{5, (P), 5, P} = 6,63 (P — P’); (14.9’) 
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All the remaining anticommutator operators, â, â+, 6, Ê+ are equal to 
zero. 

The laws of commutation ensure that the laws of commutation for 
operators w(x) and W(x), (14.3), (14.3’) will be satisfied. The physical 
meaning of the operators â, â*, b, &* is the following: the operator 
a,(p) which acts on the functional ¥ which describes the state of a 
quantized fermion field, decreases the number of electrons #,(p) in the 


state (p, r) by unity 


â, (p) ¥(..., n, (p), -..) = ¥ (..., n, (p) — 1, ...), (14.10) 
and the operator â; (p) generates an electron in the state (p, r) 
ât (p) ¥(..., n, (p), ...) = P (..., n, (p) + 1, ...). (14.10) 
The operators 5, ô* have the corresponding value for positrons 
b, (p) ¥(..., nF (P), ...) = %(.., 0, (p) — 1, ...), (14.10”) 
bt (p) Y (..., nt (p), ..) = P (..., 07 (P) + 1, ...). (14.10”) 


We note that according to Fermi-Dirac statistics, the population of a 
state n, (p) is limited to 0 or 1.t 

The law of conservation of electric current follows from Equations 
(14.1) and (14.1) 


ad, 


= =0, s 
- (1411) 


The components of this current are defined by the equation 
J, = ieby,b, (14.12) 
where e is the electron charge. The fourth component of this current 


Ja = ip = iepy, (14.13) 
can have either sign depending on whether the charge density of electrons 
is greater or less than that of positrons at a given point. 

However, if we consider a single particle state, then the quantity p has 
a definite sign and if we divide it by the charge of a particle e, we can con- 


+ The details of quantization of a Dirac field may be found in books on quantum theory 
of fields. Here we are following the notation used in [37, 38}. 
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sider it as the probability density w(x) of finding a particle in a place x 
at a given time f. 

To prove this important fact, let us consider a single-particle state, for 
example an electron. In general this state will have an undefined momen- 
tum p and an undefined spin orientation r. The functional that de- 
scribes such a state has the form 


Y(1)= F, |e- I] ôl) dp". (14.14) 


On the basis of (14.6), (14.10)-(14.10”’) and (14.11), the matrix element 
of the operator of a fermion field w that corresponds to the transition 
from the vacuum state ¥ (0) to a state ¥(1) is equal to 


W(x) = <P Ol W(x) IY (1) = fe (p) d°p u” (p) e'?* (14.15) 


It describes the wave packet of a fermion with a spin orientation r, with 
momenta defined by the amplitude c(p) and a positive energy E= + 


+(p?+m?)’?. 
A normalization of the functional of the field 
< (1), P (1) =1, (14.16) 


leads to the condition normalizing the amplitude c (p), 


Je (p) c(p) dp = 1. (14.17) 


This matrix element of the field operator f(x) (14.15) has the meaning 
of the electron wave function w(x) in Dirac’s one-electron theory. 

Let us now calculate the average value of the fourth component of 
the current J4, or, as it is more convenient, the quantity o=J,/ie in a 
single particle state y (1) (14.11). From (14.13) and (14.15) we have 


w(x) = <P (D| wpa IY (D> 
=J) yy = poya = WE (x) Wax) > 


(14.18) 
The positive definiteness of the quantity w(x) allows us to consider it as 
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the probability density for single particle states. By use of the conserva- 
tion law (14.11), the total probability 


= (4, 4) = | tras dex = fow d°x (14.19) 
remains constant in time namely, 
axa [divg d'=- | Jydo=0. (14.20) 


In conclusion, we note that the basic functions 
exp i (px — Et) 


Yp, r(x, a) = (2n)3/? us (p), (14.21) 
i(px — E 

Wp, (x, a) = mepe va(p) (14.21') 

(r = 1, 2), 


which appear in Equations (14.6) and (14.6’) form a complete system 
of functions. The condition of completeness for t’=7 gives 


es 2 d?’pyž (x, a) Yp, (x, B) = È (x — x’) dyg. (14.22) 


where the sum is taken over both spin orientations r=1,2 and both 
signs for energy (spinors u and v). If we limit ourselves to using only one 
of the energy signs, for example E > 0, and correspondingly to one spinor 
u, then in place of (14.22) we obtain 


26 
2 dwt (x, a) Wp, r(x, a) = — i (Y4Sa (x — x’), (14.23) 


where 


je- (nzm) a2) (14.24) 
OX, 
and 


otis Me ee ag 
A w= ais | OE d°p. (14.25) 


From this we can see the 6?(x—x’) cannot be expanded in terms of a 
fundamental system of functions y, ,(x, «) if we limit ourselves to states 
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with only positive frequencies (electrons) or only negative frequencies 
(positrons). 

This fact, that the function Sj, (x — x’) is spread in the region Ax ~h/mc, 
reminds us that Dirac particles cannot in general be localized more closely 
than within limits of its Compton wavelength. In the following section 
we shall consider this problem in detail. 


15. THE UNCERTAINTY RELATION FOR FERMIONS 


We consider the single particle state ¥ (1). For such a state, we can use 
the ordinary, non-quantized Dirac equation for one fermion with the 
important restriction that only one sign for the energy state E= +(p?+ 
+m?)'/2>0 be used. 

The wave function that describes such a state has the form 


2 dp l 
W (x, t) = x a c, (p) u” (p) e'?*. (15.1) 


We shall consider it to be quadratically integrable. We calculate the rms 
deviation of any coordinate, e.g., z, in this state at some instant in time. 

Using no restriction other than a choice of the frame of reference, we 
can assume that the average value of Z=0, t=0 and the average momen- 
tum p is also equal to zero. For simplicity it is sufficient to consider 
the state with one specific spin orientation r=1 or r=2. In the selected 
frame of reference, the value of the rms deviation Az? = (z — Z)? =z? may, 
using (14.18), be written in the form 


Az? = fos t) z? d?°x = |v (x, t) z7W (x, t) d3x (15.2) 
Normalization gives 


f w(x, t) d?x = 1. (15.3) 





The expression (15.2) can be transformed using the relationship 


GeiPx 





d?p 
2 (x) = “1 age amie) up 


d°p ô[c(p) “ol nes 
(2x)? Op. 





(15.4) 
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Similarly we may obtain the product zy*(x) and by carrying out the 
integration over d*x, we find that 


Gz [er ô [c (p) u (P)] ô [c (p) u(P)] 


op, op, 
Ea ô 2 ðu |? 
iaa |a ol lu? + | d®ple(pyl? = 
Op, op, 


| pa ôu ôc ôu dc* a 
+ | d3p c—ut+c* ut cu* ; 
Op, OD, Op, Op, Op, — OP, 
(15.5’) 
As we shall see later, the first integral on the right leads to the un- 
certainty relation; the second two integrals are characteristic of the 
theory of relativity and require special investigation. 
In the frame of reference we selected we can, without any important 
restrictions, consider that the amplitude c(p) has the form 


Ipi\ 1 
c(p) = (2 aes (15.6) 
Po/ Po” 
where the function f (|p|/|o|) is real and decreases sufficiently rapidly for 


Ip|/Po > 1. This indicates that the quantity po characterizes the dispersion 





(15.5) 


or 

















momentum 4p? in the state we are considering. In fact, 


sr a d°p Ipl 
Ar} = (P: — Pa)? == (EP pae Pe 
Po p 


=p | čt dé cos? 9 dQ f? (8) = «'på. 
(15.7) 


where p, =|p| cos 9, dQ=sin 9 d9 dg, €=|p|/po, and «’ is a coefficient of 
order 1. 

For a real amplitude c(p), Eq. (15.5’) becomes simplified because of the 
normalization of the spinor u* u = 1 [see (14.8)]. Using this normalization, 
we obtain 

dz = | d°p (* 2) + fer 2” 
op, Op, 


2 





(15.8) 
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We now introduce the variable č = |p|/po in place of |p| and rewrite the 
first integral in (15.8) in the form 


1 4n ( (df\? a” 

I = — — —— 2 d = 15.9 

4 | (OY eat a 

0 
(the coefficient a” is of the order 1) and the second integral in the form 

1 4x 

L= ey ar (¢) č dčM z), (15.10) 
Po 3 Po 


where the function M (é, m/po) is defined by the integral 


ĝu* ðu 
d2. 15.11 
m (z ae al Op. OP, vos 


This function is calculated in Appendix II and is equal to 








3 2 
ore for é<”, (15.12) 
m 4m Po 
Mee) 2 2 
Pp 3 
mo EO BE ee E (15.12) 
4m* po Po 
From the properties of the function M (č, m/po) we have 
Zg for — 41, (15.13) 
m Po 
h= 4n m 
— $” for —>l, (15.13’) 
Po Po 


where f' and $” are coefficients of order 1. 
Returning now to Eq. (15.8) and reducing it to a form that shows its 
dimensions more clearly, we obtain 





= h? z fo. 

Az? =a — +B — for Ap? < mc’, (15.14) 
Ap? mec 

— h2 PRES 

Az* = y — for Ap? > m?c?. (15.14’) 
Ap: 


From these equations in which «, 8, y are numerical coefficients of order 
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1, we see that the additional relativistic terms which result from the 
dependence of a spinor momentum are not important because from 
(15.14) the additional term is small in comparison to the first, and for 
Ap2>m?c?, Equation (15.14’) should be used which differs from the 
term in (15.14). From (15.14) it follows that, for Ap? 00, the magnitude 
of the variance of the coordinates 4z? may be made arbitrarily small. 

Thus, although the function 6(x—x’) is not in the class of functions 
that are allowed for the single-particle state, the theory nevertheless al- 
lows the possibility of localizing to any degree of accuracy the single- 
particle state with a specific sign for the energy that is in the class of 
quadratically integrable functions.* 





16. THE BOSON FIELD 


The field of spinless bosons is described by the scalar or pseudoscalar 
operator @(x) which has specific symmetry properties in an isotopic 
(charged) space. 

We shall consider a complex scalar field which describes charged 
particles. Such a field satisfies Klein’s equation Ë 


(O° -m)ĝ =0, (16.1) 

(O? — m?) @* =0, (16.1) 
and the commutation laws 

[o (x), p* (x')] = iD (x — x’), (16.2) 


where [A, B]= AB—BA is the ordinary commutator, and the function 
D(x—x’) is defined by Equation (14.5): 


1 -sin ot 
D(x) = — | a ee ax: 163 
0) = Gps | Peg nee) 


Here k is the momentum of the boson, œw is its energy in the system of 
identity elements where h=1. The remaining commutators 


Lo (x). ô N = [6* (0), 6* (x) = 0. (16.2) 


t In this section we followed the work of [30]. 
t See [37, 38]. 
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The boson field @(x), in analogy to the fermion field, can be written in 
the form of an expansion of a fundamental system of orthogonal func- 
tions 


ôx) = fon {a (k) U (k) = + 5* (k) U*(k) e ™}, (16.4) 


ĝ* (x)= | d°k {a* (k) U* (k) e~* + b(— k) U (— ke}, 
(16.4’) 
where 
1 


U(k) = F (16.5) 


w= +(k?+m7)!/?, m is the boson mass. The operators â, d*, b, 6* have 
meanings analogous to the operators in Equations (14.6) and (14.6’) for 
the fermion field. That is, they satisfy the commutation laws 


[á(k), a* (k)] = 6° (k — k’), (16.6) 
[5 (k), 5* (k)] = 6° (k — k’); (16.6’) 
and the remaining commutators, [4, â'], [4*, â+], [6, 6), (6, 8+1, [4, 8%], 
are equal to zero. The physical meanings of the operators â, d*, b, b+ are 
the following: The operator â(k) operates on the functional ¥ which de- 
scribes the state of a boson in a quantized field, and it decreases the num- 


ber of positively charged bosons n* (k) in the state with momentum k and 
energy w= + (k? +m’)! to unity: 


a(k) ¥(...,n* (k),...) = Y (..., n* (k) — 1,...); (16.7) 


The operator â* (k) increases the number of positively charged particles 
in the state (k, œ): 


a* (k) ¥(...,n* (k),...) = P (..., nt (k) + 1, ...). (16.7’) 


The operators b(k) and Ê+ (k) have the same meanings for negatively 
charged bosons: 


Dk) W(..., n7 (k),...) = Y (..., n7 (k) — 1,...), (16.8) 
5+ (k) Y (..., n7 (k),...) = P (..., n7 (k) + 1,...). (16.8) 
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From Equations (16.1) and (16.1’) we obtain the conservation law 


ô, 
-= =0, (16.9) 
ôx, 

where 
~ . f, 00 ôĝ* 
J,= *— — ĝ?, 16.10 
icf Soh (16.10) 


The fourth component has the meaning of an operator of the electric 
charge density 


J ax 
nmi miedo SE gh. (16.11) 

However, in contrast to the fermion case, we shall now see from the 
following that one cannot ascribe a positive-definite probability density 
«(x)= p/e to bosons, nor even to single-particle states with a specific 
sign for the energy. We shall now consider such a state. Let ¥ (1) be a 
functional of the field which describes the single particle state with an 
undefined boson momentum k and with a defined sign for the energy 
@>0. This functional may be written in the form, absolutely analogous 
to that for the fermion field (14.14), 


P(1) = [amen 5(n* (k) — 1) 
x ô [n (k)] TI 6[n* ky] 6 [n (ky). (16.12) 
k#k’ 


The matrix element of the operator of a boson field corresponding to 
the transition from the vacuum state ¥ (0) to the state ¥ (1), which is 
described by (16.12), may be calculated from Equations (16.4), (16.4’), 
(16.7)-(16.8’) and is equal to 


o (x) =<¥ (0)| (x) lP (1) = faw U (k) e** d3k (16.13) 


It describes the wave packet of bosons which have a momentum whose 
amplitude is A (k) and which have a positive energy w= +(k?+m7?)!?. 
Normalizing the functional 


<¥ (1), ¥(1)> = 1 (16.14) 
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leads to the normalizing condition for the amplitude A (k) 
[ew A(k)d’%k=1. (16.15) 


This matrix element ọ (x) of the operator of the field @(x) (16.13) that 
we have considered has the meaning of a boson wave function in the 
single-particle theory in a scalar field. 

We shall now compute the average value of the fourth component of 
the current J, or, as it is more convenient, the quantity w(x)=J,/ie in 
a single particle state ¥ (1) (16.12). From (16.11) and (16.13) we obtain 


ô 0o* 
ce" aliwa 


w(x) = CP (Li (o a 


: ô 6p* 
= (o w = W 940), (16.16) 


where (x) is defined by Equation (16.13), the integration over d?k is 
taken for only positive energies and kx =kx — wt, œ = + (k? +m’)'/?. The 
quantity W is defined as 


0g dg* 
W=(9,W= dex =i *— _ —_  } d°x, 
(9, Y) fro x f(e AET o) x 
(16.17) 
and is conserved due to the continuity equation (16.9): 
dw E ôw 


—_ = aren — | divs ax | Jy do =0. (16.18) 
dt ôt 


The generalized quantity 


ôy ðọ* 
(9, W) =i |(o" 7 = = v) d3x (16.17’) 


may be considered as the scalar product of two single-particle functions 
ọ and y. If we take for these two functions the basic solution 
eilkx — ot) 


= Gap Ve 


(x) (16.19) 


then they describe a complete system of functions. The condition of 
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completeness for t=?’ gives 








[ot (x) p(x’) d?k = A* (x — x’), (16.20) 
where the function 4* (x) is equal to 
Ji j— Ph a 
= : 16.21 
ome eee) 


Because of this fact, the function 53(x—x’) which corresponds to the 
“exact” localization of bosons does not belong to the class of functions 
represented by the expansion (16.13). Using the fact that the single- 
particle wave function (x), which has a certain sign for energy, satisfies 
the equations 


ĉo 
i— = o@, 16.22 
i ôt ( ) 
— * 
j = A > 16. k 
i 7 Op (16.22') 


where the operator w’ is equal to 


=+, m- V’; (16.23) 


and using Eq. (16.11), the density w(x) may be written in the form 
w(x) = 3(@0*@ + p*ôọ). (16.24) 


From (16.24) we may see that, by itself, it does not guarantee positive 
definitess. In Appendix III we give an example of a state of a boson field 
which is written in the form of the superposition of two waves g, and 
@, which have distinctly different momenta k, and k, and whose energies 
have the same sign (w,>,>0): 
c c 

(x)= 9, +—— o. (16.25) 
Vo 2o, 
In this case 


le,|? c 
w(x, t) x ——41 + 2 
w; Cy 








cos [(k, — k,, x) —(@, —@,)t+ a. 
(16.26) 
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and if |c,/c,|>1, then this density oscillates in space with a period that 
is of the order of 1/|k,—k,| and in time with period 1/|@,—a@,|. 

The energy density of this field e(x) is both physically observable and 
a positive-definite quantity in the case of a boson field. The operator of 
this quantity (x) may be derived from the energy-momentum tensor of 
the boson field 


Pop 4. Jae* 06 
aus Rye : 
Tin (%) = oe ôx, 2 (FS ôx, E R Ue) 








From this we obtain 


A dt 66 
— T. VOT V6 Pte]. 
TERR Tga 2 E L VGV + mG "o) 
(16.28) 
Therefore the average value of e(x) in the state ¥ (1) is 
e(x) = <P (1)| ê (x) [F (1)> 
0g* ð 
(E Z2 + vot Vo + mgro) >0, (16.29) 


17. THE LOCALIZATION OF PHOTONS 


The electromagnetic field belongs to the class of boson fields, but unlike 
the scalar meson fields, it is described by a vector operator — the vector 
potential operator Â, (x), where p=1, 2, 3,4 with components A(A,, 
A), Â), ĝ=Ã4li. 

This potential is defined to within the accuracy of a gauge transforma- 
tion 


o. 


Xu 


(17.1) 





Â, (x) = Â, (x) + 


where f (x) is an arbitrary function that satisfies d’Alembert’s equation 
Of (x) = 0. (17.1) 


Because of this arbitrariness, the characteristic values of the vector poten- 
tial are not physically measureable quantities. The measureable quantities 
are the components of the antisymmetric tensor of the electromagnetic 
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field 
F= ee ak (17.2) 
ôx, OX, 
or, in vector notation 
$--- > v, (17.2 
H = rotA, (17.2") 


where @& is the electric field potential operator, and W is the magnetic 
field potential operator. 

A distinctive feature of a free electromagnetic field is its transverse 
character. Therefore only three of the four components of the vector 4, 
are independent. This places a restriction on the choice of the wave func- 


tional ¥ 
(y 


If this condition is satisfied at the initial instant in time, then it is possible 
to show that it will be satisfied for all time from the equation of motion.* 
When the condition (17.3) is satisfied, then the equation for the vector 
potential of the electromagnetic field becomes 


a, 


= 3 
Ay r) 0. (17.3) 








u 





Eat, =0. (17.4) 
The commutation laws for the operator A may be written in the form 
[A,,(x), A, (x’)] = — iD (x — x’) ôv, (17.5) 
For m=0, D(x) has the form 
i . sinwt 
D(x) = ikx 3k, 17.6 
0) = oop | We aay da 


where w= |k|. The expansion of the vector potential in a fundamental 
system of functions has the form 


4 
Â, (x) = X dĉk eż (A, (k) U (k) eikx + A} (k) U* (k) e7], 
a=1 
(17.7) 


t For details on the quantization of the electromagnetic field, see [37] or [38]. 
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where U(k)=1/(2)'/”, kx =kx— wt, e} are the components of the unit 
vector which define the polarization and A,(k) and 4} (k) are the opera- 
tors of the amplitude of partial waves. We note that all four polarizations 
appear in this expansion; two are transverse, one longitudinal and one of 
time. Using Eqs. (17.2) and (17.2’) for the observed quantities Ê and 
Z with condition (17.3) satisfied, we obtain the expansions 


2 
8 (x) = y f d?k œ (Âi (k) UL (x) — Az (k) UF (x) e, 
A=1 


(17.8) 


2 
H (x) = D f dk [ke*] (4, (k) U, (x) — Az (k) UF (x), 
A=1 


(17.8’) 


Only the two transverse polarizations for A=1, 2 appear in these equa- 
tions. From the commutation law (17.5) we obtain the commutation con- 
dition for the amplitude A,(k) 


[A, (k), Aj (k’)] = ô’ (k = k’) Oya (17.9) 
[A, (k), Ax (k’)] = [Az (k), £x (k’)] = 0. (17.9") 


The only physically observable quantity which satisfies the conservation 
laws, in the case of a vector field, is the energy-momentum field tensor 
T,, (x). The operator which represents this tensor has the form 


T (0) =F. F ip — F OP ons (17.10) 
from which we have 
Ty (x) = — 3 (8? + R), 
Tha (x) = —i@ x #), = — if}. (17.11) 
The conservation law for the tensor 7,,,(x) states that 
OF i) y; (17.12) 
Ox, 


and, in particular, for the energy-density operator 


ex) = — Ty, (x) (17.13) 
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we obtain from (17.12) 
08 (x) 
t 





+ div §(x) = 0, (17.14) 


where S(x) is the vector energy flow operator whose components are 
S (x) =iT4,(x). We shall now return to the single photon state with an 
undefined momentum k. 

For this case we note that, by virtue of the commutation laws (17.9) 
and (17.9), the amplitude operators Â, (k), A+ (k) operate on the wave 
functional ¥ in a manner identical to that of the scalar field operator that 
we discussed in Section 16 [see Eqs. (16.7), (16.7’)]. For a single photon 
state, this wave functional has the form 


¥(1) -faw dèk 5(n,(k)-— 1) [| [I ôDa k], 
a=1,2 k Fk 
(17.15) 
where A, (k) is an arbitrary amplitude of the wave whose wave vector is k 
and polarization is 4 (A=1, 2). 
Using the law of operation of the operators A, (k) 4} (k) on the wave 
functional [the laws are expressed by equations which correspond exactly 


to Equations (16.7) and (16.7’) for scalar field operators d(k) and 4* (k)], 
we find that the matrix element of the operator A, (x) is equal to 


A, (x) =<¥ (0)| 4, IFA = [a (k) U (k) e™* d?k. 
(17.16) 


The function A,(x) contains only positive frequencies and, therefore, it 
satisfies Schrddinger’s equation 


0A, (x) 








PAOD a joys (17.17) 
l at a(x) 
where ô= +(—A?)!/?. The function A*(x) satisfies a similar equation 
oA* r 
-i o = ôA} (x). (17.17) 


However, because the operator ô is not self-conjugate, the quantity 
A*A, does not satisfy just any conservation law. 
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As we already remarked, the quantity which is conserved and is com- 
posed of real, measurable (observable) quantities, is the magnitude of 
the energy density ¢(x) = — T44 (x) which is expressed in terms of the elec- 
tric & (x) and magnetic W (x) fields. 

It is easy to show that the energy density ¢(x) is a single photon state 
is equal to 


e(x) = <P (1) ê (x) |P (1)> 
= $[8 (x) &* (x) + H (x) #*(x)], (17.18) 


where &(x) and #(x) are the matrix elements of the operators & (x) and 
HA (x) for the quantum transition ¥ (0)> ¥ (1). 

Thus in the case of a free electromagnetic field, one cannot construct 
a quantity for the probability density of finding a photon in one or an- 
other part of space at a given moment in time. However, there does exist 
an energy density that can be focused into any specific region of space. 


18. THE DIFFUSION OF RELATIVISTIC PACKETS 


In the preceding section we considered the state corresponding to a strong 
localization of quantum fields (localized packets). Such a state is not 
stationary, but changes with the passage of time in such a way that the 
degree of localization of the field decreases. An initially well defined 
wave packet diffuses so that the quantities 4x”, Ay”, 4z? increase in time. 
Characteristic features of this diffusion are common to all fields and 
therefore it is sufficient to examine the case of a scalar field. Let the wave 


packet have the formt 
k) , 
(x,t) = i gilkx— on dk (18.1) 
w 


where w = + (k? + u?)t/?. We assume 


(k — k,)? 


c(k) = N exp — zp? 


(18.2) 


where N is a normalizing factor, N = 1/(b°? (x°)1/)1/?. Because the density 


t See [30]. 
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p(x, t) is equal to 
i 0g* 0g 
yo basi — m g* — > . 
p(x, t) (o aT’ e) (18.3) 


the localization for t=0 will be strong if ọ (x, 0) or dø* (x, 0)/dt, or both 
of these quantities are strongly localized. 
Placing (18.2) into (18.1) we obtain 


k — k}? 
OED oh iv | expl “oO + ik dk 





ôt 2b? 
. in x C( — X?/20°) 
= — ile kı (a3 m mA secese (18.4) 


where a=1/b and a—>0 for b> œ, so that the state (18.1) belongs to the 
class of states which are strongly localized for t=0. 
For t>0, we have 


60 (x, t ee 
0006) _ _ ineitex-o I(x, 1), (18.5) 
ot 
N, 
Q (x, t) = — 1x o I(x, t), (18.5) 
w: 


where 


Zk 
I(x, t) = few exp| - oe 


+i(k —k,)x —i(@ — @,) ‘| (18.6) 


It is assumed that in (18.5’) that |k; |> b. 
For definiteness, let the vector k, be directed along the Ox axis so 
that k; =(k,,, 0, 0). Then 
1 2 2 2 1 kis 2 
OSU og ie? 5 eee 
(18.7) 


and q=k—k,. Simple calculations yield 


E ead 
I(x, th=A (t) exp| iz (x, t) — 2a = 22 3 (18.8) 
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where A(t) is a slowly varying quantity, « is the real phase, and the 
quantities Aj, and Aj are given by 


1 b*m* 





AQ=pt—ee, (18.9) 
1 b 
AOD tae. (18.9) 
Assuming 4? (0) = 1/b?, these equations may be rewritten in the form 
2 mê 

4i0= 20+ 25 OO (18.10) 

2 2 0 x? 242 8 
Ai(j=A 18.10’ 
i(t) = 4°(0) + -7 yE ©” ( ) 


where 4=h/p is the wavelength, p the particle momentum and v=0E/ép 
=p/E is its velocity. 

From (18.10) we can see that, for m=0, the wave-packet does not 
diffuse in the longitudinal direction as it ought to for particles whose rest 
mass is of the order m=0. In this case, de Broglie waves do not diffuse in 
vacuum so that dw/dk =c = constant. Equation (18.10) agrees with Equa- 
tion (12.19) which was derived from diffraction theory for a beam which 
was formed by a narrow slit of width 4(0)=a. 

However, there are important differences in the behavior of relativistic 
and non-relativistic beams. The width of the packet increases in pro- 
portion to 4717v? [see (12.19)]. In the non-relativistic case 7?v? =h?v?/p? 
=h?/m? =constant, and in the relativistic case 470? =h?/E? for E00 
and 72v? +0. In the non-relativistic case, the dispersion is the same in 
the longitudinal and transverse directions, 4 =4,. In the relativistic 
case an extra factor of (m/E)* appears in the dispersion in the lon- 
gitudinal direction. The factor tends to zero for E> oo and it significantly 
decreases the growth 47 (t) in time. This shows the tendency to a beha- 
vior similar to that ofa vera rest mass particle for which 47 (t)=constant. 

By rewriting (18.10) in the form 


2 2 
A? (t) = A2(0) = (2) ct, (18.10") 


where A, =h/mc, we may treat the decrease in the rate of growth 42 (t) 
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as if were due to a retarded time t’=mt/E and the motion of the system 
as if it were due to the “twin effect” — the life time of a moving packet in- 
creases by E/m. 

Thus relativistic effects have a stabilizing influence on a wave packet. 
The characteristics of relativistic packets that we have discussed are 
general for various covariant fields. This is evident from the fact that all 
of these characteristics are related not so much to the form of the am- 
plitude of the packet c(k) as to the law of dispersion for de Broglie waves 
in vacuum 

GOA De Tse, (18.11) 


dk o Sk? +m? 


which behave in an absolutely different way for k>m and k <m. 


19. THE COORDINATES OF NEWTON AND WIGNER 


As we showed earlier, in the relativistic case 6(x) cannot be the eigenfunc- 
tion of the coordinate operator for an elementary particle because it 
cannot be formed from a state with a positive energy. 

Therefore Newton and Wigner [39] introduced a new operator for 
positive particles ĝ, whose characteristics are derived from the concept 
of a localized state y. They postulated the following characteristics of the 
state which represent a particle which is localized at a moment of time t 
at the point x=y=z=0: 


(a) these states forma linear set, i.e. the superposition of two localized 
states is also a localized state; 

(b) this set transforms into itself under rotation about the origin and 
under spatial and temporal reflection; 

(c) if y is a localized state, then a spatial displacement makes it 
orthogonal to states which are localized at the origin; 

(d) they must satisfy some requirements of regularity which allow 
operators of the Lorentz group to be applied to localized states. 


It is particularly easy to find such states for spinless particles. In the 
momentum representation the wave function for a spinless particle y 
is a function of the momentum k and ky =w=(k? +m7?)"/?. In the case 
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of scalar particles, from (b) it follows that y is a function of k?, or of a, 
i.e., W=W(w). 

We now use requirement (c). The operator of spatial displacement 
T;(a) reduces to a factor e'** in the momentum representation 


p(k) = T; (a) y (w) = eY (a). (19.1) 


This state must be orthogonal to the state which is localized at the 
origin, i.e., to Y (œ). We take for ọ (k) and y (œ) the covariant amplitude 
in the form 

3 


p(x) = fow gks = (19.2) 
oO 


so that p(k) =A (k) (w)!/? and y (w) = B(w) (w)!/”. According to (16.17’) 
the scalar product of g and y has the form 


dek 
(py) = | pry — ; (19.3) 
w 
Substituting the values of ọ and y into this equation gives 
2 
| Wo ek Qk = 0, (19.4) 
w 


where a#0. From this it follows that |y (œ)|?/œ=constant, that is 
W(@) = w'!?, We may, for example, set 
1/2 


= (27)?! Š 


Y (w) (19.5) 


Applying the displacement operator to (19.5) we obtain a function 
which represents a state localized at the point x 


a a}? : 
Wx (k) = T(x) Y (œ) = On? ex, (19.6) 


It is easy to see that this function is the eigenfunction of the operator 4, 


q i( 2 + ka 19.7) 
= — || — Pai I's ž 
fz ak, 207 ( 
which is just the coordinate operator of an elementary particle according 
to Newton and Wigner. 
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Applying the operator ĝ, to (19.6) gives 
«Wx (k) = x, (k), (19.8) 


so that x is the characteristic value of the operator 4, in the state w,(k) 
which is localized at some point x at t=0. 

We now turn to the coordinate representation of a state which is 
localized at the point x=0 at the moment in time r=0. According to 
(19.2) and (19.5) we have 


` 1 ikx 5/4 
O f Saps (5) HY), (imr), (19.9) 


(27)? w !!2 


where H$}} is the Hankel function of the first kind and of the order 3. 
For r>0 @(r)xr_ >” and for r>0 (r>h/mc) @(r)xe~™. From this 
it is obvious that the region of localization of the state is defined to an 
order of magnitude by the Compton wavelength of the particle h/mc 
[in (19.9) c=1 and h=1). 

From the behavior of the function @(r) near zero it follows that the 
localized state belongs to the class of unnormalized functions. The “‘local- 
ized” state y (œ) and the new coordinate operator q, that we have con- 
sidered in this section follow naturally from the principles postulated in 
(a), (b), (c) and (d).* 

However the physical interpretation of these states and operators gy, qy, 
q- has a number of difficulties and therefore these operators are not as 
useful as the general coordinates x, y, Z. 

The first of these difficulties is contained in the fact that a state which 
is localized at t=0, x =0 in one coordinate system, will not be localized, 
in the sense of fulfilling conditions (a), (b), (c) and (d), in moving coordi- 
nate system, even if the points x=0 and x’=0 coincide at t=t'=0. In 
view of this fact the operators G,, G,, G, do not possess the characteristics 
required for relativistic covariance. 

The second difficulty is contained in the fact that it is not possible to 
construct a gedanken experiment to measure the operators 4,, q,, 4, that 
is, to single out the localized state y (x). The Newton-Wigner coordinate 
operator seems, therefore, to be the coordinate operator of the center of 


t Note that because we restricted ourselves to considering spin-zero particles, we did not 
use all the requirements listed from (a)-(d). 
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gravity of the wave packet @(p) which, by definition represents the 
localized state. 

In view of these difficulties we prefer to use the “old”? coordinates 
x, y, z and t which are measurable, at least in a gedanken experiment. 
Such an experiment is based on the assumption that it is possible to 
produce an arbitrarily narrow beam, an arbitrarily small wavelength 
%— 0, and therefore on the principal assumption that there exist arbitrar- 
ily small apertures a0. Although these assumptions are rather formal, 
they have the advantage of being definable. 


20. THE MEASUREMENT OF A MICROPARTICLE’S COORDINATES IN 
THE RELATIVISTIC CASE 


The formulation of the problem of measuring the coordinates of a 
microparticle in the relativistic case is very similar to the non-relativistic 
case that we discussed in detail in Sections 11 and 12. As in the non- 
relativistic case, we must distinguish between direct and indirect mea- 
surements. 

In the case of a direct measurement, we consider the measuring of the 
coordinates of a free particle. In Section 14 we showed that for particles 
with spin 4 which satisfy Dirac’s equation, there are allowed single- 
particle states with a positive energy which can be localized arbitrarily 
close to the point x= x’ at the moment in time t=t'. From the laws of 
diffusion of such states, it follows that this extremely localized wave 
packet ofa state will diffuse with the speed of light if it does not itself move 
with a very high speed, so that the average momentum of the particles in 
such a packet p>mc where m is the mass of the particles. 

In this case (see Section 18) the longitudinal diffusion of the packet 
ceases and the velocity of transverse diffusion decreases by a factor of 
m/E [E=(p? +m?)?!?], 

The localized single-particle state for the case of a particle with spin 
zero (which satisfies the Klein-Gordon equation) and with spin one 
(photons which satisfy Maxwell’s equations) exist only as a special ex- 
ception. 

As we showed in Sections 16 and 17, in these cases it is more reasonable 
to speak of the localization of the meson field rather than of the mesons 
or photons. The second feature of the relativistic case is the significant 
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decrease of the arbitrariness in the choice of possible interactions. The 
interactions must satisfy certain requirements of relativistic invariance. 
However these requirements do not greatly restrict gedanken experiments 
because the theory of relativity does not impose any restrictions on the 
mass of the charged particles, except for the conditions relating to the 
existence of the particles in bound states. The binding energy of the par- 
ticles has to be significantly smaller than their proper energy 


AE <me?. (20.1) 


Let us now turn to considering direct measurement of coordinates, 
which for the non-relativistic case was considered in Section 12. As in 
Section 12, we consider the atoms in a photographic plate as infinitely 
heavy (Mc* > E, where E is the energy of the particle) and we assume that 
the excitation energy of the atom ¢=g, — £ọ is small in comparison to the 
energy of the particle (E>) whose coordinates we are measuring. In 
order to write the 6-function interaction (12.2) in a relativistically 
invariant form, we consider that it is due to an exchange of very heavy 
mesons. In this case 


e7rla 
W =g 





o (20.2) 
where a is the Compton length of a hypothetically heavy meson, g is a 
coupling constant and r =((x — Q)°)"?. In the limit of an infinitely heavy 
meson a—>0, the function (20.2) is concentrated in a volume of the order 
of a0. Under the conditions discussed above, the atoms of a photo- 
graphic plate remain a non-relativistic system and relativistic consider- 
ations extend only to the description of the motion of the free particles 
whose coordinate is defined by observing the consequences of excitation 
of the atoms of the photographic plate, and its interaction with these 
atoms. 

In the Born approximation, which we used in Section 12, relativity 
does not change the main character of the conclusions which were drawn 
in Section 12 regarding the physical processes involved in such measure- 
ments. In particular, replacing the 6-function interaction (12.2) by the in- 
teraction of ultra-heavy mesons (20.2) and replacing the Schrédinger 
wave function by a four-component spinor which satisfies Dirac’s 
equations does not change the conclusion that, for small intervals of time 
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[see (12.8)], the excitation of an atom is accompanied by the localization 
of a microparticle in the vicinity of the atom. This result also holds true 
for mesons which satisfy Klein’s wave equation. 

We now turn to indirect measurements. In this case we mean coordi- 
nate measurements of particles that are in bound states using observations 
of scattering of a narrow beam-probe. We first note that, insofar as we 
assume that bound microparticles retain their individuality, we have 
predetermined the smallness of the mass defect AE in comparison to 
the characteristic energy of the particle mc*. Therefore, relativistic 
effects will not affect the basic conclusion drawn in Section 12 from con- 
siderations of non-relativistic indirect measurements. A more important 
fact is that the narrow beam, which is necessary for finding the location 
of a bound microparticle in the relativistic case, due to the reasons 
discussed in Section 18 can be made significantly narrower, and to havea 
much smaller aperture angle than that allowed in the non-relativistic 
packet. 

In concluding this discussion we shall make a few comments re- 
garding the possibility of obtaining such narrow beam-probes. We are, 
of course, not discussing their practical realization but only the theoretical 
possibility of this. We may consider that the necessary narrow aperture 
whose width is a>z (for 40) may be made from a slit in a strongly 
absorbing material. If the necessary absorption is due to the production 
of new, very heavy and correspondingly slow particles, then their gener- 
ation will not “spoil” our beam-probe. The existence of apertures with 
such properties does not contradict the principles of quantum theory of 
fields and, therefore, they may serve as a formal basis for discussion of 
extremely narrow beams and small wave packets (see Appendix IV). 


CHAPTER IV 


THE ROLE OF FINITE DIMENSIONS OF 
ELEMENTARY PARTICLES 


21. THE POLARIZATION OF VACUUM. THE DIMENSIONS 
OF AN ELECTRON 


When analyzing the possibility of measuring the space-time coordinates 
of an elementary particle, they were assumed to be point particles, and 
thus the accuracy to which they could be determined was limited by the 
possibility of producing an arbitrarily narrow beam-probe which inter- 
acts by contact, interaction with the microparticles whose location in 
spacetime is thus measured. The latter requirement is necessary for the 
elimination of the effects related to the existence of a finite, or even infinite 
radius of interaction. 

As shown before, this possibility is allowed in the form of a gedanken 
experiment. However, it also follows that the physical picture of the 
point particle is actually an approximation. In higher approximations in 
the powers of the interaction constant, effects arise which may be 
considered as the manifestation of the structure of the particle which is 
dynamic in nature and cannot be reduced to a distribution of some 
material in space. 

In our discussion of the structure we will begin with the simple case of 
an infinitely heavy, electrically charged point particle surrounded by a 
Coulomb field, the potential of which is equal to 


Vi) = £, (21.1) 


where e is the charge of the particle and r is the distance from the particle 
to the point of observation. 

The interaction of two charged particles, where one is scattered on the 
other, may be represented by a Feynman diagram (see Figure 15a). M 
denotes the heavy point particle, line 1-2 the path of the scattered particle, 
and the dotted line q the path of the virtual photon which transfers mo- 
mentum from one particle to the other. In addition to this simple diagram, 
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there are other diagrams which give a contribution to this process. One 
of these is shown in Figure 15b. This diagram describes the production 
of a positron electron pair, the e*, e~ loop, by a virtual photon q and 
the annihilation of this pair in the subsequent emission of a photon q 
which in turn causes the momentum exchange between the particles. 


2 


2 
et 
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M M 
= 
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(a) (b) 


Fig. 15. (a) An illustration of the scattering of a charged particle (line 1-2) on an in- 

finitely heavy (fixed) charged particle M. q is a virtual photon. (b) The polarization of 

vacuum is taken into account in this diagram. It includes the corrections to Coulomb’s law 

that are needed to account for the polarization of vacuum. q and q’ are virtual photons; 
e*t, e” area positron and electron of a virtual pair. 


This rather complicated diagram takes into account in the first non- 
vanishing approximation the effect of the polarization of the electron- 
positron field by the Coulomb field of the massive particle M. The compu- 
tation of this effect is very tedious and we refer the reader to books on 
quantum electrodynamics for the necessary details. 

The calculations show that the taking into account the polarization 
correction leads to a new expression for the potential of a point charge. 
In place of (21.1) we have 


Vin= ai (21.2) 


where the quantity e(r) is called the effective charge. From the calcula- 
tions the effective charge is equal to 


t See, e.g., Chapter VIII in [37], Chapter VI in [38]. 
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2 
fie Zmn(4) +=} for r & A, (21.3) 


3n r 


e(r) = 2 7 \32 
efi +teml(Z) + for r > A. (21.3) 
nt A 


where a=e?/hc, A=h/mc, where m is the mass of the electron. From 
these equations we see that, for small r, the effective charge is large. In 
proportion to the increase in the spacing r, this charge is screened by the 
vacuum polarization more and more strongly, so that for r>A it 
becomes equal to the charge of the particle e that is macroscopically 
observable.t From Equation (21.3’) we also see that the effective charge 
is distributed in a region whose dimensions are of the order of magnitude 
of the Compton length of an electron (h/mc=386 x 107+! cm). The 
existence of a vacuum polarization leads to the important possibility of 
the scattering of light off an infinitely heavy charged particle. 

It is well known that, Maxwell’s equations being linear, the static 
field and wave field may be superimposed on each other without dis- 
tortion. The vacuum polarization destroys this linearity. In the first non- 
vanishing approximation of the interaction of a quantum of light with 
the field of a heavy particle, the polarization of vacuum may be described 
by the diagram shown in Figure 16. The lines 1 and 2 show as before the 
incoming and outgoing paths of the scattered photon. The loop e* 
illustrates the formation of virtual positron-electron pairs which are 
produced by the vacuum polarization. The lines q, q’ are the paths of the 
virtual photons which transfer momentum to the heavy particle M. 

We must, therefore, keep in mind that, when a photon with a frequency 
«> 2mc?/h interacts with a heavy charged particle (e.g., with an atomic 
nucleus), the dominant effect will be the positron-electron pair forma- 
tion (e*e~) and, therefore, absorption of the photon. This absorption 
results in the elastic diffraction scattering of photons. The total elastic 
scattering cross section c. consists of the elastic diffraction scattering 
cross section o,, and the elastic scattering cross section that is related 
to the vacuum polarization o% and which is defined by the process shown 
in Figure 16. 


t We must remember, however, that Equations (21.3) and (21.3') are approximations and 
valid only as long as the corrections |e(r)—e| are small in comparison to e. 


98 SPACE AND TIME IN THE MICROWORLD 





Fig. 16. An illustration of the scattering of light on a fixed charged particle M. The 

scattering actually takes place on the electrons and positrons (e*) that result from the 

polarization of vacuum by the particle M. Here, as in Figure 15, q and q’ are virtual photons 
and e* are a virtual pair. 


Because the cross section for production of a ø, pair is compara- 
tively large, the ratio of o% and g% is small: 


oa _ 3 eaj in #2 (21.4) 





oa 7 \hao mc? 
(see [37], Section 55). 

Note that for frequencies @<2mc? for which real pairs of electrons 
and positions cannot be produced, the effect of vacuum polarization may 
be described in terms of the dielectric constant tensor for vacuum €; and 
the magnetic permeability tensor ju; as 


1 

Eik = Onn + g2 {2 (6? + H’) On + THH h ’ (21.5) 
0 
1 

Hik = Ôik + z2 {2 (8? = H?) Òx — THH } : (21.5’) 
0 


where &, W are the electric and magnetic field intensities, £; and #; 
(i= 1, 2, 3) are the projections of these vectors on the coordinate axes and 
ĉo is some critical field which is defined to an order of magnitude by the 
equation 


h 
efo — = 2mc’, (21.6) 
me 
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implying that the potential difference in such a field over a separation h/mc 
is equal to the energy required for pair formation. In this manner, as a 
result of the polarization of space around a charged particle, a charged 
“atmosphere” is formed around the particle, the extent of which is defined 
by the electron Compton wavelength h/mc. Because of this fact, particles 
lose their point nature. 

In particular, if we take the particle as being localized in a very narrow 
wave packet whose dimensions are smaller than h/mc, then this situation 
will correspond to the case of the stationary, heavy particles that we 
considered above. A charged region of the order of h/mc is formed 
around these particles and because of this the interaction of these particles 
with the particles of the probe can take place. And, therefore, an un- 
certainty which cannot be removed arises in the definition of the location 
of the particle, which is equal in order of magnitude to the quantity 


neat zy 
Ax PA 3.86 x 1071! cm. (21.7) 

In the case of electromagnetic interactions, this atmosphere is very 
tenuous. This follows from the smallness of the electromagnetic inter- 
action constant «=e?/hc =1/137. Therefore, the number of interactions 
in the atmosphere of particles in regions of polarized vacuum will be very 
small in comparison to the number of point interactions. In the over- 
whelming majority of cases the uncertainty in definition of particle co- 
ordinates will not be related to the dimensions of the particle atmo- 
spheres. 

In particular, corrections to electron scattering, to the Compton 
effect, to bremsstrahlung and to other electromagnetic effects that result 
from the vacuum polarization in the vicinity of an electron (or positron) 
are very small (see [37]). 


22. THE ELECTROMAGNETIC STRUCTURE OF NUCLEONS 


An entirely different situation exists for the cases in which the dimensions 
of the atmospheres of the particles are formed from strong interactions. 
Because of the significant magnitude of the strong interaction constant 
g (g?/4nhc=16) the atmosphere that forms around strongly inter- 
acting particles is not, under any circumstance, rare. 
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This assertion may be illustrated using the electromagnetic “form 
factor” t which define the charge and current distributions in nucleons. 
The Feynman diagram shown in Figure 17a represents the scattering 
process on a proton in which the electron is considered as a point. Lines 
3 and 4 represent the path of the scattered electron; lines 1 and 2 represent 
the proton path; the vertex I which is indicated by the shaded triangle 
emphasizes the extended character of the nucleon, and q is the path of 
the virtual photon which transfers momentum between the proton and 
electron. The diagram in Figure 17b illustrates the origin of the extended 
nucleon. At the point A a proton emits a 2 *-meson by means of a strong 
pion-nucleon interaction. The z*-meson at point B emits (or absorbs) a 


2 





1 
(a) (b) 


Fig. 17. (a) The heavy line 1-2 is the proton path; q is the virtual photon and the line 

3-4 ıs the path of the electron being scattered. The vertex F is the region around the proton 

in which the virtual photon is emitted or absorbed. (b) The structure of the vertex I is shown 

in this figure to a first approximation. At the vertex A, a n*-meson is emitted which then 

emits a virtual photon q at the vertex B. At the vertex C, a 2*-meson is absorbed by a 
virtual neutron (line AC) which then turns into the scattered proton (2). 


virtual quantum q, and finally at the point C this meson is absorbed by a 
neutron which is transformed into a scattered proton (line 2). By this 
process a pion atmosphere is formed around the proton and this also 
creates the effect of an extended nucleon. The dimensions of this at- 
mosphere is defined by the meson Compton wavelength h/mc=1.4 
x 107*3 cm. 


t This term is borrowed from X-ray scattering theory. The “form factor” takes into ac- 
count the point character of charge distributions and currents inside atoms in this theory. 
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The diagram we considered above as an illustration of how an extended 
nucleon is formed has a conditional meaning. Due to the rather large 
value of the interaction constants of the pion and the nucleon field, the 
contribution of the other diagrams that describe processes with a large 
number of virtual mesons cannot be small. There are also significant 
resonances in the interactions of these mesons with each other. Therefore, 
the computation of these effects, due to the existence of the meson at- 
mosphere of the nucleon, present great and as yet unsurmountable 
difficulties. 

However, it is possible to construct a phenomenological theory of 
a nucleon’s meson atmosphere in terms of the form factor. In order to do 
this, we compute the matrix element of the vertex particles T, (x2, x1, Y), 
and we do not restrict ourselves to diagrams of the type shown in Figure 
17b, because in the case of strong interactions such limitations are not 
justified.* 

The quantity I,,(x2, x1, y) represents a nucleon current with all inter- 
action effects in vacuum taken into account. It is more convenient to 
compute the matrix element of the interaction energy of this current with 
the field of a virtual photon, which we write in terms of the vector po- 
tential 


A,(y) = aq (Q) e. (22.1) 


This energy of interaction is equal to 


W (x2 — xı) = fr. (x2, X1, Y) Ay (y) d*y. (22.2) 


We note that, because of the homogeneity of the vacuum, the quantity 
I, the translational invariant, correspondingly depends only on the 
difference of the coordinates x,, x,, y. Therefore, in place of the coordi- 
nates, it is more convenient to use the quantities x=x,—x,, and X=y 
—4(x, + x2). Therefore, 


T, (x2, Xis yal (xX, x) 


eo o fe i(aX+Bx) 44. 44 22 
-gy [nepe d*a d*B, (22.3) 


t In Appendix VA, the calculations of the vertex I’,(x2, x;, y) are explained on the basis 
of perturbation theory (diagram in Figure 17b). This simple calculation is suited for the 
case of weak coupling between nucleons and mesons. 
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so that I", (a, $) is the Fourier transform of T,(X, x). 
Conservation of current, which is equivalent to the gauge invariance, 
requires that 


ôr, oO, 

— = = 0 

ay. OX, (22.4) 
or 

Č u(% B) a, = 0. (22.4’) 


We now calculate the energy W(x, — x) matrix element for a transition 
of the nucleon from state with momentum p, =p; to a state with momen- 
tum p,=p,. These states are described by the spinors 


Vr (x2) = Us (pe) ePi yi (x) = u (pi) e73. (22.5) 
We have 


(IWI ò) = fro W (x2 — x1) pi (x1) dfx, d*xy. (22.6) 


Placing W (x, -—x,) from (22.2) into this equation and using (22.3) we 
obtain 


(f IWI i) = (27) ôt (py — pi +q) ts (pp) Ža, P) u;(p;) a, (a). 
(22.7) 


Therefore, 
(fli) = (2)* 64 (p; — pi — q) ñs (p) P(g, P) u; (p), (22.8) 


where «=q=p,—p; is the momentum transferred and B=P=p, +p ; is 
the sum of the initial and final momenta. 
We now define the structure of the current vector 


u = ū,(p;) l(a P) u; (p;). (22.9) 


Note that in constructing this vector we used the Dirac matrices y,, 6,, 
and the vectors g=p,—p;, P=p,;+p;, the Dirac matrices y, and ysy, 
being pseudovectors. Therefore, the current is of the form 


J, = tie (pp) {aq, + bP, + cy, + dO gygy + eoyyP,} ti (pi), 
(22.10) 


FINITE DIMENSIONS OF ELEMENTARY PARTICLES 103 


where the coefficients a, b, c, d and e are scalar. Using Dirac’s equations 
for the spinors u,(p,) and u;(p;) and the additional condition (22.4’) 
which may now be written as 


Ja, = 9, (22.4") 
we may simplify (22.10) and reduce it to the form 


J, = üş (pp) {B(Q?) yu + C (0°) Ouray} ui (Pi) (22.11) 


(see Appendix V.B). Here B(q°) and C(q°) are the only form factors 
which remain after the simplification of (22.10). For point particles with 
charge e and anomalous magnetic moment p we have B(0)=e, C(O) =i. 
Therefore, we use the following normalization for form factors 


B(q’) = eF, (q’), (22.12) 
C(q’) = inF,(q’), (22.13) 
F, (0) = F,(0) = 1. (22.14) 


Under this normalization, the nucleon current takes on the form 


J, = tty (pp) {eF; (q?) Ya + iF, (4°) Ouvdy} u; (pi). (22.15) 


We shall consider the exchange of momentum in the center-of-mass 
system of the particle. For the case of elastic scattering of electrons on 
protons, this will be the system of the center of mass of the electron 
and proton. The magnitude of the momentum of the particles remains 
constant in this system, and only the directions change so that |p;|= |p;l, 
and correspondingly the fourth component of the vector q is equal to zero 
[q4=(q, 0), q=q,—q;, 7 =q’]. In this coordinate system we may define 
the spatial image of the nucleon; that is the spatial form for the distribu- 
tion of the electric charge density p(X) will be 


d3q 
p(X) =e fr (qô) exp (iqX) On) (22.16) 


and the magnetic moment distribution u (X) will be 
d3q 


Qn (22.16’) 


u(X) =u | F, (q”) exp (iqX) 


From these equations it follows that 
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| p(X) d?X = eF, (0) =e, (22.17) 


faw d3x = uF, (0) =u. (22.17) 


From (22.16) and (22.16’) we may define the rms of the electric radius 
<r?>, and the rms of the magnetic radius < r?°), by means of the equations 


a 1t 2 d? -5 
<r » =} for OX => ia) (22.18) 
2 -+| 2 43 sE) ; 
<r =; u(X) 7? BPX => it), (22.18’) 


The second equation is obtained by using the Fourier expansions (22.17) 
and (22.17’) differentiated with respect to q?. 

From Equations (22.16) and (22.16’) it follows that the form factor 
F, (q?) defines the diffusion in space of the electric charge of the nucleon, 
and the form factor F, (q”) defines the diffusion of its magnetic moment. 

Thus the form factors F, and F, reflect the space-time structure of 
nucleons. This structure was first discovered experimentally during 
studies of electron scattering in hydrogen and deuterium by Hofstadter 
[40-42]. His basic result shows that 


Fi, (4°) = Fap (0°), (22.19) 

Fyn (g0, Fon (q7) © Fip (q’), (22.19’) 
where we denote the form factor for protons by p, and that for neutrons by 
n. Figure 18 shows the recent results of the electromagnetic form factor 
for nucleons [43]. The electric radii prove to be the same: for protons 
r,=(<r2),)!/? =0.8 x 10713 cm and for neutrons r,=0. The magnetic 
radii r=(<r7>,)"/? are equal to r,. 

The spatial structure of nucleons which is defined by Equations (22.16) 
and (22.16’) has a somewhat arbitrary character. This arbitrariness 
arises from the fact that, from the point of view of quantum field theory, 
the electromagnetic structure of the nucleons is described by the vertex 
function I’, (X, x) which is a function of two variables X= y —4(x, +2), 
x=x,— x, and not by one variable. Furthermore, this structure is de- 
scribed as dynamic, taking into account the nucleon motion in its inter- 
action with the virtual photons. 
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Fig. 18. This graph represents the form factors F, and F, as functions of the square of 
the transferred momentum q?. Within experimental error, F, and F, are the same. 


In Appendix V.A a similar coupling is considered, between the vertex 
T (X, x) and densities p (X), p(X), which are defined by Equations (22.16) 
and (22.16’). We also show there that these equations have a direct 
physical meaning for heavy nucleons and for small momentum transfers 
q? <4M? (M is the nucleon mass). In this case the density p(X) may 
be written in the form 


p, (X) = | F,(q, 2M) e d3q, (22.20) 


in which the assumption q? <4M? is accounted for in an obvious way. 

We now return to our main theme, to the problem of measuring the 
location of a nucleon. Because the dimensions of the pion atmosphere of 
the nucleon are finite even if particles in the beam-probe were localized 
exactly, one cannot define the location of nucleon by scattering a probe 
particle off of it, more accurately than to within the limits of its 
dimensions 4x ~r,. In the case we considered of electromagnetic inter- 
actions, the radius of interaction, strictly speaking, is infinitely large. 
This property of the electromagnetic (Coulomb and magnetic) interac- 
tion is in itself a serious obstacle to the defining of the nucleon location in 
the gedanken experiment illustrated in Figure 14. However, this fact is 
not all important. Instead of the electromagnetic interaction we may 
consider a weak interaction which is a contact, and point interaction. As 
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we shall now show however, the same form factors F; (q?) and F;(q’), 
which characterize the electromagnetic interaction, enter into this inter- 
action. 

For definitness, we shall consider the process that is the reverse of the 
process in which a proton p captures a y-meson namely, 


n+vopt+y. (22.21) 


where n is a neutron and v is a neutrino. The initial particles are neutral 
and therefore the effects which are related to the electromagnetic inter- 
action are absent. 
The interaction which describes this process is a contact one and has 
the form [44] 
G 
W= Z Sy, (22.22) 


where Gp is the weak interaction constant. Gp= grlhc(me/h)*, m is the 
nucleon mass and gp=2 x 10~*° erg cm? is the Fermi constant. This 
constant may be expressed in terms of the length Ap=(g,/hc)'/” =0.6 x 
x10716 cm, which is characteristic of weak interactions. J} is the nucleon 
“strong” current given by 


Ji = Ọna" Un (u= 1,2, 3, 4). (22.23) 


where Jy, Wx are the nucleon wave functions (proton or neutron). The 
operator J, for point nucleons has the form 


J, = ey, + ipo pyy. (22.24) 


t* is the operator of the isotopic spin which transforms a neutron into 
a proton (i.e., it changes the symbol N=n to N=p). 
The current J} is the “weak” current and it has the form 


Jy = $,0,9,, (22.25) 
where @, is the neutrino wave function, @, is the y-meson wave function, 
and Ô, is the operator that is composed of the y, and ys matrices (for 
details see [44]). 

The existence of the operator t* =4(t, +it2) (for reverse transition 
the operator t~ =4(t, —it,) would be used) shows that J; consists of a 
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linear combination of the components of the isotopic current vector 
pis Jaz Ja3: 
Jas = ORAN > (22.23) 


wherec = 1, 2, 3. However, the components of the current J}, correspond 
exactly to the electromagnetic current and therefore, the operator J, 
[see (22.24)] has to contain the same form factors F, (q”) and F, (4°) which 
are characteristic of the nucleon [see (22.19) and (22.19’)]. In other 
words, the current J; accounts for the charged pion atmosphere of the 
nucleon. Because of this, the contact character of the interaction (22.22) 
is violated and it becomes non-local. 

The Feynman diagrams describing this contact interaction (22.22) 
and the interaction taking into account the finite nucleon dimensions, are 
shown in Figure 19a and 19b. Due to the formation of a space-time 
distribution of nucleon charge and currents at the vertex I’, even in a 





(a) (b) 


Fig. 19. Diagrams which describe a weak interaction (the process n+v—>p+ p7). 
(a) Diagram for a contact interaction. (b) Diagram in which the finite dimensions of the 
nucleon are taken into account (the form factor). 


gedanken experiment which allows the existence of an arbitrarily narrow 
packet of neutrinos for a probe to define the location of a nucleon, it is 
impossible to define the location of the scattered neutrino more accurately 
within the limits of the nucleon dimensions, i.e., in a region Ax~r, 
(or r,). Thus we can see that the existence of form factors of elementary 
particles which relect their space-time structure leads to the fact that it 
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is impossible to localize a particle in space-time more accurately than to 
within the limits of its dimensions, i.e., than to the accuracy 


Ax ~ fo, At ~ ro/c, (22.26) 


where rọ is the particle radius. 


23. THE MESON STRUCTURE OF NUCLEONS 


In this section we turn to the theory of elastic scattering of pions on 
nucleons in high-energy regions (higher than the threshold for meson 
production). In this case non-elastic processes play the determining role 
so that elastic scattering appears as diffraction. Consequently the scat- 
tering amplitude T(E}, q) will be imaginary [E, is the meson energy in 
the center of mass system, E=(m?+k?)!/? and q is the momentum 
transfer, |q|=2k sin 9/2, 9 is the scattering angle] and may be expressed 
directly in terms of differential scattering cross section 


do 
do IT(E,, QI? = — T(E, Q). (23.1) 


Thus we may obtain the amplitude 7(£,, q) from experimental data 
for do/dQ. On the other hand, this amplitude may be written as a series 
of Legendre polynomials P, (cos 9) 


1 ; 
T (E,,q) = a), (21 + 1) (1 — e?'") P,(cos9), (23.2) 
i 
1=0 

where 

milk) = ô (k) + iyik), 120, ... (23.2’) 
is the complex phase. For sufficiently short waves (4 <a, where a is the 
dimension of the scattered wave system) the sum over / may be replaced 
by an integral by assuming /A/— b db/ż? =k?°b db, where b is the impact 
parameter. Furthermore, for large /and small 9 the polynomial P, (cos 9) 
may be replaced by the Bessel function 


Jo (isin 9) ~% Jo (bq). 
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Then (23.2) may be rewritten as 


fe} 


k S 

T (E,,4) = > f b db[1 — e7?™® 7] J (bq). (23.3) 
o 

Because the Bessel function Jo (bq) is orthogonal, this integral may be 

inverted to give 


co 


r 2i 
1 — e?"0 P = af T (E,, q) Jo (bq) q dq. (23.4) 
0 

In this manner, the phase n (b, k) may be defined from experimental 
values as a function of the meson energy k =(E? —m?)"’? and the impact 
parameter b. On the other hand, from the limits of geometrical optics, 
the phase changes of the initial waves 2y (b, k) may be calculated by inte- 

grating over the path of the index of refraction n, with the result 

B 


2 (b, k) = | n(k, x) ds. (23.5) 


Figure 20 shows that integration path inside a nucleon. 



























































EET 


(a) (b) 


Fig. 20. (a) The sphere represents the nucleon. AB is the path of a beam inside of the 

nucleon; b is the impact parameter. (b) A partially absorbing screen which is equivalent 

to the nucleon. This screen gives the same diffraction pattern (for small scattering angles) 
as the nucleon. 
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Analyses of the experimental data for pion scattering on nucleons 
(x* p) were carried out in [42, 45]. The results of this analysis is shown 
in Figure 21, where the absorption coefficient K(r) is plotted as the 
ordinate. The imaginary part of the refraction coefficient n (its real 
part is extremely small in the case being considered) is plotted on the 
abscissa as a function of distance from the nucleon center r in the center 
of mass system. The absorption decreases with distance from the center of 
the nucleon, and it practically vanishes for values of r greater than 


kir) 10 em" 





\ 
\ 
\ 
\ 
\ 
\ 
\ 
\ 
\ 


0 02 04 O06 O08 10 12 14 16 r, fermi 


Fig. 21. The continuous curve is the pion absorption coefficient K (r) as a function of the 
distance from the center of the nucleon r for the pion energy E= 1.3 GeV. The dotted curve 
is the same function for E=5 GeV. 


r=0.7 x 10°13 cm. The agreement of this value with data on the electro- 
magnetic radius of a nucleon (r,=r,=0.8 x 107+? cm) shows that the 
absorption of mesons by nucleons and the distribution of currents and 


charges in a nucleon are caused by one and the same physical cause, the 
meson atmosphere of the nucleon. 


FINITE DIMENSIONS OF ELEMENTARY PARTICLES 111 


To what extent can we justify this simple optical picture of the meson- 
nucleon interaction? Present-day field theory does not give any direct 
method for calculating strong interactions, to which class belong the 
interactions of z-mesons with nucleons. Therefore, we can only give 
the conventional reasons which are characteristic of the acceptance of 
current theory. These are the assumptions (i) that the interaction 
constant g is small (this condition is necessary to ensure the convergence 
of the series), (ii) that it is possible to normalize all of the approxima- 
tions in the powers of g, (iii) that the final equations obtained may be 
used without any limitations on the value of the constant g. 

Following this line of reasoning, we obtain an equation which has a 
completely defined structure but which, for practical use, requires 
further assumptions [46]. As is shown in Appendix VI this equation 
has the form 


(E — R’ yx) = | U(x, x; E) W(x’) d°x’, (23.6) 


where K?= -—V? +m? (m is the meson mass), E=E,+ Ey, Ey is the 
nucleon energy in this system, U(x, x’; E) is, generally speaking, the non- 
localized complex potential which acts from the direction of the nucleon 
on the meson; w(x) is the wave function for relative motion of the 
nucleon and meson, measured simultaneously so that x =x. —X,,t=t, — 
—1,=0. 

Another equivalent equation is 


(Ex — FR) W(x) = | V (x, x’; E) w(x’) dex’, (23.7) 


where #8 is the Dirac operator #2=ap+BMc?, and M is the nucleon 
mass while V(x, x’; E) is the potential due to the meson which acts on 
the nucleon. 

The relationship between U(x’, x; E) and V(x, x’; E) is also given in 
Appendix VI. The incompleteness of the theory, which underlies Equa- 


t A similar method is used in the dispersion relation theory. The scattering matrices, which 
are required for establishing the analytical properties of the scattering amplitude, are con- 
structed using perturbation theory and assuming the interaction to be small. The same dis- 
persion relation is used in the region of strong interactions for all values of the interaction 
constant. 
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tions (23.6) and (23.7), is expressed in that these nonlocal and complex 
potentials can be calculated explicitly only in the form of a series in 
powers of the interaction constant g. If we remove the assumption that 
the constant g be small, we must make certain assumptions about the form 
of U or V. A similar situation exists in the nuclear theory in which the 
complicated nucleon interaction is replaced by a potential well which 
describes the interaction in an average manner. 

If the wavelength of relative particle motion is significantly greater than 
the dimensions of the region in which the non-localized potentials U and 
V differ significantly from zero, then Equations (23.6) and (23.7) reduce 
to the simplified form 





(E? — R?) W(x) = U(x, E) W(x), (23.6) 
(Ex — #2) W(x) = V(x, E) y(x), (23.7') 
where 
U(x, E) = | U (x, x’; E) m d?x', (23.8) 
z 1 VX) a3, 
væ p= [vax neg x (23.8') 


and the quantity W (x’)/w (x) under the integral is replaced by unity. Equa- 
tions (23.6’) and (23.7’) have the form of Schrédinger’s or Dirac’s equa- 
tions with a complex potential U(x, E) or V(x, E) that is a function of 
energy. 

Equations (23.6) and (23.7) also simplify in another limiting case, for 
A<a. Note that, because of a relationship which is valid for short waves 


o = na’ (1 — P), (23.9) 


where ø is the elastic scattering cross section and $ (0<f <1), the trans- 
missivity of the nucleon may be written in the form 


a o 12:1 
GPA ae (23.9’) 


In order to satisfy this condition, we write the wave function in a form 
characteristic of geometrical optics 


w(x) = exp {ikS(x)}, (23.10) 
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where S(x) is the action function and k=1/A. Substituting (23.10) into 
(23.6'), we collect together terms according to the order of k? (we assume 
that k? 00). Equating the k?-order terms gives the eikonal equation 


(VSP =n’, (23.11) 


where n is the complex index of refraction that satisfies the equation 
1 
n -l= ale (x, x’; E) exp ik {S (x^) — S(x)} d3x’. (23.12) 


Expanding the difference S (x’)—.S(x) and the index of the exponential 
under the integral in powers of (x’—x) gives 


S(x’) — S(x) = VS (x) (x — x’) +---=npcosO+---, (23.13) 


where p=x—x’. Then Equation (23.12) becomes 


n? -l= af U (x, x + p, E) elt? 08 g3p (23.14) 
If we replace the index of refraction n on the right-hand side by n= 1, we 
obtain the first approximation for n?—1. This result may be refined by 
iteration. By assuming n=«+ if, it is easy to show that a necessary con- 
dition for convergence of the iteration is that kB—constant for k> œ. 

On the basis of the optical theorem, according to which the imaginary 
part of the amplitude 7 (E£,, q) of forward elastic scattering (i.e., for g=0) 
is related to the total cross section or by the equation? 


Im T(E, 0) = =o (23.15) 


where E,=(m?+k?)'/?, and using the experimental fact that the total 
cross section of zN interaction does not increase with energy £, but tends 
to a constant value, it is possible to show that the quantity k, decreases 
as k— œ, and that thus the condition necessary for convergence in (23.14) 
is ensured [47]. 

This optical approach to describing the spatial structure of a meson 
atmosphere of a nucleon is only one of the many ways of studying this 
structure [48]. The qualitative and quantitative conclusion that we ob- 


t This theorem is a result of the unitary character of the scattering matrix; see Appendix 
XIL4. 
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tained using an optical model of the nucleon are in agreement with the 
basic quantum theory of meson fields. 


24. THE STRUCTURE OF PARTICLES IN 
QUANTIZED FIELD THEORY 


We shall study the structure of elementary particles using elastic scatter- 
ing of probe-particles. In modern theory, there are three types of inter- 
actions to be considered: strong, electromagnetic, and weak. Thus we 
have to consider three types of elementary particle structure that corre- 
spond to the three types of interaction. 

The concept of elementary particle structure differs from the classical 
concept of the structure of an object. The reason for this is based on the 
following fact. From experimental data on elastic particle scattering we 
construct the scattering amplitude 7,; which corresponds to the transi- 
tion of the scattered particles from some initial state i to a final state f. 
Reconstructing this amplitude from experimental data in itself presents 
a serious problem [49], and is known under the name of complete 
experiment. 

The amplitude 7;, is a function of two invariants, s= P? (P is the total 
momentum of the scattered particle) and t= — q7, where q is the momen- 
tum that is transfered and is equal to the difference between the final and 
initial momenta of one of the particles; g=p,— pi, i.e., 


Tp = A(t, s). (24.1) 


The information contained in the amplitude relates to both states of 
the particle being studied, i and f, which are characterized by the momenta 
pi and pp. Thus this information is presented to us in the form of inter- 
ference between two structures which are related in the classical sense 
to two different states of motion of the particle being studied. On the basis 
of the well-known “optical” interpretation, we may count on obtaining 
information on the space-time structure of an object in a scale of b only 
in the case that the event are considered as acts of scattering in which 
a momentum g2h/b is transferred. We may consider the quantity b as 
the impact parameter. The duration of the encounter will be defined by a 
time of the order of t=6/v where v is the velocity of the particle-probe 
which is close to the velocity of light. 
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In a time t the particle being studied travels a distance of the order of 
[= (b/v) (p/m), where p is its acquired momentum and M its mass. In 
order that the “structure” will be related to the initial state of the particle- 
object, /<b. It is easy to see that this is equivalent to the requirement that 
(h/M) (p/q) <6, and because the momenta p and q are of the same order 
of magnitude, we obtain 


b> h/Mv, (24.2) 


i.e., we may study the classical structure of a particle only on scales of b 
that are significantly greater than the Compton wavelength of the particle 
being studied. This qualitative estimate is in agreement with the better- 
grounded estimate given in Appendix V in connection with the electro- 
magnetic description of nucleon structure discussed in Section 22. 

Secondly, to the degree to which the energy of the particle-probe in- 
creases and correspondingly to the degree to which the upper limit of 
the momentum transfer |q| max= 2p increases (where p is the relative mo- 
mentum of the particle-probe and the particle-object), the relative role 
played by inelastic processes increases and the picture of the object that 
is observed by scattering particles acquires a diffraction character from 
the strongly absorbing body. As noted in [45], if in some region b< R the 
absorption becomes total, so that this region may be considered as a black 
body, all information about the structure of this region is destroyed.t 
The quantity R is called the “elementary length” that depends upon the 
dynamics of inelastic processes. 

Therefore the behavior of the ratio between elastic scattering pro- 
cesses c, and total cross section oy for an unlimited increase in energy 
(so) 


lim —! = 2 (24.3) 

OT 
may have fundamental meaning in future theory. The question of the 
behavior of cross sections of inelastic processes ¢;, is also important. If 
Oin > 0 for s> œ, this would indicate that the elementary particle becomes 
transparent at extremely high energies, and “blackness” would not exist. 
If lim o;,>0, then, in elementary particle encounters, a macroscopic 


t We note that for various interactions this state can arise (if it can in general) when the 
energies and scales are significantly different. 
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quantity of material could arise because of their high energies, with no 
limitations on the amount. In this case elementary processes would be 
able to generate macroscopic processes and a general field of activity in 
the micro- and macroworld would be discovered [51]. However 
“revenons à nos moutons” t In order to investigate elastic scattering at 
high energies more deeply, let us assume that we are able to sum up all 
the higher Feynman diagrams which contribute to the elastic scattering 
amplitude. Figure 22 shows such diagrams for a field œ which describes 


Y2 n Yè n Y2 


Nn Y2 n Y2 pA] 
z = K + + + fot + 
K a na A w 


xX; X: xX. x x. 
2 1 (c) 2 1 (d) 2 


Fig. 22. The left-hand side is the function s(y,, Y2 | X1, x2) and the right-hand side is its 

expansion in terms of Feynman diagrams. (a) is the diagram for the first order in g; (b) is one 

of the diagrams for the second order (g”); (c) is one of the third order (g>); (d) is one of the 
fourth order (g*), etc. 


uncharged, scalar particles. The energy of interaction in such a field may 
be written as 


W = 1964, (24.4) 


where g is some interaction constant. 

Summing such diagrams, which are in the coordinate representation 
and have two incoming lines (x, and x.) and two outgoing lines (y; and 
Y2), leads to the four-parameter function 


S = 5 (X4, X3, X2, X1), (24.5) 


which describes the particle scattering process (two particles initially and 
two finally). 

In Appendix VII we show that the scattering matrix § which describes 
all possible processes with scalar, uncharged particles, may be expressed 


t An old french proverb: “Let us return to our sheep”. 
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as a set of scalar functions 


s(x), 
s (x1, X2), 


§ = s(x > X25 X3) > (24.6) 


S (Xis X25 -++) Xn) 


Each of these functions is the vacuum average of all possible processes of 
particle interactions which are accompanied by particle production and 
annihilation. 

Because vacuum is homogeneous, these functions are functions only 
of the coordinate difference x =x;— Xp, in particular s(x,)=constant. 
Furthermore, in this case of a scalar field, these functions depend only 
on the scalar product x;,x,,, and hence equally on all x;. 

The matrix elements of the scattering matrix which describes the transi- 
tion from some initial state y;(x,, X2,..., Xp) with n particles to a final 
state W (V1, Y2» +--+» Ym) With m particles, may be expressed in terms of the 
functions (24.6) as 


Sri = IT; = [to Visa Ym) 


x S(V1, Y2>-+09 Ym | Xis X25. Xn) yix, X23- Xp) 
x d*y, dty, ---dty,, dfx; dx, + d*x,, (24.7) 


where the arguments of the function s(x, X2,..., XN), N=n+m, are 
related to the final state f, denoted now by y1, y2,..., Ym, and separate 
characteristics of the arguments x,, X2,..., X,, are related to the initial 
state i. Such a division of the arguments of a function s(x,, X2,..., Xy) into 
parts which belong to the initial and final states allows us to write the 
arguments in the form of a matrix 





x 











xy X1, X2 X15 Xz X3 
y 
1 
(24.8) 
Yı sO |x) S(Vy | X1, X2) 
Yis Y2 SCV, Y2 | X1) SCV, Y2 | X1, X2) 





Yis Y2» Y3 
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whose columns (y) are related to the final state f, and whose rows (x) to 
the initial state į. Not all the elements whose sequence is given in the table 
are different from zero, and not all describe real physical processes with 
real particles. For example, the interaction (24.4) excludes in general 
the functions s(x|y) with an odd number of arguments. Elements of 
the type s(yı | X1» X2, X3) differ from zero but cannot describe real 
processes because the rest mass of particles (y,) has to be greater than 
3m. This contradicts the assumption on particle spectra, in which only 
particles of rest mass m are allowed. The corresponding element Sp; for 
real particles automatically becomes zero. We shall not consider this char- 
acteristic detail. 

Let us now return to the most important aspect of the problem. We 
assume that we can localize a particle in space and time to any degree of 
accuracy. In other words, we assume that it is possible to take wave func- 
tions y, and w; in the form of a product of four-dimensional 6*-functions 


¥;=TS*0,-y%), W= [I ee, - x). (24.9) 
q=1 q=l 
Then the integration in (24.7) may be immediately carried out and we 
obtain 


Sri = sYb Vos -+ Vin | Xi X25 009 Xp) (24.10) 


From this it follows that the function s(y|x) in (24.8) gives the most com- 
plete information on the space-time development of the process of par- 
ticle interaction N =m +n. Thus it is this function that must be considered 
to obtain a complete description of the structure of scalar, elementary 
particles which interact with each other by emitting and absorbing par- 
ticles similar to themselves. We may restrict the scope of our description 
which is related to the concept of particle structure to only those func- 
tions s(y|x) from (24.8) that describe elastic scattering, namely, the func- 
tion s(y1, ¥2|x1, X2). In this case we may consider that one of the par- 
ticles probes the structure of another. This point of view is of course pos- 
sible, but not obligatory. In one way or another, the description of par- 
ticle structure based on §-matrices has a dynamic character that cannot be 
separated from the processes that cause the structure to become apparent. 

This dynamic concept of space-time structure of a particle is shown in 
Figure 22, using the Feynman diagrams for the special case ¥ =g6*. 
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The function s(y;, y2 | X1, X2) is used as an example. This function is 
represented by the sum of contributions of the operator (24.4) in powers 
of g. Each of these contributions is represented as a sum of Feynman dia- 
grams whose number equals the power of g. The first diagram (a) describes 
the interaction (elastic scattering) of point particles. The other diagrams 
include virtual particles which form clouds around the original particles 
(x1, X2 | Y1, Y2). This cloud has the following dynamic meaning: At 
each of the vertices particles produce other particles which as a result 
of successive transformations, begin as the initial particles x, and x3, 
and result in the final particles y, and y,. Summing all the diagrams 
and averaging them over vacuum, we account for all possible paths which 
lead from (x,, x2) to (1, Y2). 

Returning now to real particles, we must consider the three types of 
interaction. One particle may take part in more than one interaction. 
For example, a proton interacts strongly with pions, electromagnetically 
with electrons, and weakly with neutrinos. These interactions correspond 
to different S-matrices, and the nature of the particle structure which is 
reflected in each of these interactions may be unique to one of the inter- 
actions. It is highly probable that in the future the inner relationships in 
these interactions will be found, and then the various structures will be 
understood as manifestations of the same phenomenon. t 

Furthermore, we must remember that the concepts of the space-time 
structure of particles presented in this section are based on theory which 
is only well founded for electromagnetic interactions. 

Mathematically, this concept is based on the set of functions s(y|x) 
[see (24.8)], which may be obtained theoretically, but are unobservable 
quantities. The fact that they are not observable follows from the fact 
that it is impossible to construct four-dimensional 6-like wave packets 
in space-time (24.9) and to use them as functions of initial and final state. 

In reality the initial state y; and the final state y, are superpositions 
of plane waves for which p= +(m?+p?)'/*. Therefore, information 
about the function s(y|x), which is obtained from knowledge of the 
matrix elements S;; of the scattering matrix S, is extremely limited. The 
essence of the matter is contained in the fact that the spectral decomposi- 


t In atomic physics the nuclear electron structure is the only basis for understanding of 
the behavior of atoms in the light of both X-rays and electron beams. 
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tion of the function s,; in momentum space R(p) composes only a small 
part of the spectral decomposition of the function s(y|x).t 

In order to consider this question in greater detail, let us return to the 
real case of elastic scattering of z-mesons on nucleons. Because of the 
difference of these particles, we shall write the function s(y|x) in the form 


S(y| x) = s Y2 x2| X1, y1), (24.11) 


where x, is the coordinate of the nucleon in the initial state 


Wi (x1) = u; (py) i, (24.12) 
and x, is its coordinate in the final state 
Vi, (x?) = üp (p2) e7, (24.13) 
yı and y, are the z-meson coordinates in the initial and final states 
etk: = e7 ikzy2 
py) ==>; P0) =——. (24.14) 


Let us now introduce the coordinates 


x= xX — X7, é =}X -4y + ya), 
, l (24.15) 
X =3(x, + x2), 2 =2X — Yı — y2), 
so that 
s (Yz X2 | Yo x1) = s(x, Ei, &). (24.16) 


This last function has the spectral decomposition 


s(x, či, 2) = fie By, B2) exp [i (ax + Bič + B2¢62)] 
x dta d*B, d*B,, (24.17) 


where the integration over the variables x, $4, 2 is not restricted to the 
mass surface. 

An evaluation of the matrix element S ;; of the function (24.17) for the 
states 


We = Wy (X2) Ps (V2), Wi = Wi (x1) pW) (24.18) 


t We already met a similar situation in studying the vertex I’, (X, x) (see Section 22). 
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leads to the result 
Spi = ôt (pp + ky — pı — kı) üp (p2) 
y 3(— pı — ky, — q, ky + żq) 


Thus the function 3(«, $4, P2) is known only for the values «= — (p, +k,), 


Bı = —q, P2=k; +34. 
The volume element dfa d4ß, d4ß, in (24.17) is 


dta d*B, d*B, = dtp, dtk, dq 
_ bdu dtp, m dm d*k, 
~ EN E 





u; (pı). (24.19) 





dqo d3q. (24.20) 
where Ey is the energy of the initial nucleon in the center of gravity 
system, E, is the energy of the initial meson in the same coordinate 
system. A substitution of s from (24.19) into (24.17) does not give the 
complete function s(x, €,, €,), but only its “visible” part 4s as 


1 
As (x, Či, é= 40 (s(n: —k,, — 4 kı +34) 


x ap] - i(py + kı) xı — iq¢č, 


rs i(k + 2) a| d3q, (24.21) 


where 4Q denotes all the multipliers in (24.20) except for d3q. 

We now consider the special case, assuming x = 0, č, =0 in which the 
the variable € takes on the meaning of the distance between the nucleon 
and meson, in which 


As (0, č, 0) = aa sin — ky, — q, kı + 24) 
x exp(— iqë) d3q. (24.22) 


If we restrict ourselves to the region in which k, +4q~ky, i.e., the case 
of a high-energy initial meson and small momentum transfer q?, then 
in (24.20) we may neglect q in the expression $, =k, +4. Then (24.21) 
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may be considered as a Fourier transform for the variable q. We write 


s(- Py ky, — 4, K1)ao=0 = 5(E, q’), (24.23) 
where E=Ey+E, is the total energy of the system. The quantity 

As (0, &, 0 ; 

ae = f: (E, g?) è% d?q (24.24) 


gives us an idea about the structure of nucleon-meson interaction. How- 
ever, using (24.19), it is more convenient to introduce the quantity 





V (E, 6) = | V (E, q) e% d?q (24.25) 
(where 
~ 2 
V(E,q) =~ E i N, (2426) 


which is analogous to the interaction potential of the meson and nucleon. 
In fact, in the limiting case of a small potential V (E, č) the scattering 
amplitude may be written in the form 


T zi (E, q) = ü; (Py) Ÿ (E, q) u (pì), (24.27) 


which agrees with (24.19) if the definition (24.26) is used. 

We call the quantity V (E, &) in (24.25) the effective potential of a 
meson-nucleon interaction [50]. 

Figure 23 shows experimental data for the differential elastic (x*p) 
scattering cross section. If we consider the (x* p) scattering amplitude as 
purely imaginary, its real part being small, then we can construct two 
curves to approximate this experimental data. These curves correspond 
to the two potentials (24.27): 


2 
V (E, &) = i% (E) exp| - =I. (24.28) 


exp(— ¢/a) 


T (24.28’) 


V (E, ¢) = iV (E) 


The first of these equations agrees better with experimental data than the 
second (see Figure 23). The length a in both cases is a=0.6/0.7 x 107 +°. 


FINITE DIMENSIONS OF ELEMENTARY PARTICLES 123 





-~ 
~ 
-I 








T panes teas i T 
0975 0950 0925 0900 
cos 8 


Fig. 23. A comparison of theoretical curves for the differential elastic scattering cross 

section da/dQ for pions on protons as a function of the scattering angle 0. The solid 

curve is derived from the effective potential (24.28) and the dotted curve from the po- 
tential (24.28’). 


Such a value corresponds to the interchange of two z-mesons or a vector 
p-meson. 

Thus the “effective potential” V (E, €) gives an idea about the structure 
of a nucleon, which agrees with the concept of a meson atmosphere of a 
nucleon. 


CHAPTER V 
CAUSALITY IN QUANTUM THEORY 


25. A FEW REMARKS ON CAUSALITY IN THE 
CLASSICAL THEORY OF FIELDS 


The concept of ordering of events contains within it the most important 
concept of causality which is based on the chronogeometric division of a 
manifold of events into time-like and space-like parts. Therefore, causal- 
ity may be considered as a geometric category and the study of causal 
problems is only one of the possible aspects of geometric analysis. We 
shall begin our analysis of causality in the microworld with a few remarks 
on the classical theory of fields. 

In the classical theory of linear fields, causality is expressed most direct- 
ly using a Green’s function © which represents a field Qy(x), arising 
from a point source localized in space and time 


Qo (x) = — 64 (x) = — ô (t) & (x). (25.1) 


For a scalar field, the Green’s function satisfies the inhomogeneous 
Klein equation 


(O? — m3) © = — d(x). (25.2) 


Following Appendix I [see (1.12)-(1.14)], we obtain the retarded Green’s 
function G,., 


_ JD(x) for t>0, 
Sat =] 0 EY (25.3) 


where the function D (x) coincides with the commutation function defined 
in (1.3), and the advanced Green’s function 6,4, 


0 for t>0, 


=D? for t<0. (25.3’) 


Gaav (x) = | 


These equations reduce to a very simple form for the case m=0, which 
corresponds, in quantum theory, to particles with zero rest mass. In this 
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case we obtain from (1.3) 
1 6(¢-r) 


— fi t>0, 
Gal) =j r nas (25.4) 

0 for t<0O 

and 

0 for t>0, 
= 1 ô f 
Gal) 5) LEH r <o. (25.4') 

4n r 


The solution G,,,(x) represents an infinitely narrow impulse which 
spreads out from the point r=0. The solution 6,,,(x) represents the 
same impulse that converges into the point r =0 (see Figure 24). The first 
solution describes the impulse which transmits the effect of an event 
(impulse) from the point r=0 to distant points. The second solution 


(a) (b) 


Fig. 24. (a) An infinitely narrow impulse that diverges from the point r=0. (b) An 
infinitely narrow impulse that converges at the point r=0 


transmits the effect of an event located outside of the point r=0 to this 
point. Where the particle rest mass differs from zero, following this con- 
centrated impulse there extends a “tail” of slower effects which are 
described by the terms in the function J, (—im(x?)'/?)/(im(x)!7) [see 
(1.3)]. In corpuscular theory, the existence of this tail might be inter- 
preted as an effect of particles that move with a velocity that is less than 
the velocity of light c. 
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Both of the solutions G,,,(x) and G,4, (x) become zero outside the light 
cone. This indicates that an interaction at the point x, =0 cannot prop- 
agate away from this point with a velocity greater than that of light. 
Similarly it cannot arrive at the point x, =0 from other points x, witha 
velocity exceeding that of light. Three possible effects are shown in Figure 
25. These are (a) the point x, lies inside the light cone, in the future 
relative to the point x, (by convention we let x. > x,); the effect of x, on 
xX, is described by G,,, (x), x =xXz— x, >0; (b) the point x, lies inside the 
cone of absolute future relative to the point x, (by convention x, < x4); 




















XQ > X1 





X2< X4 


(a) (b) (c) 


Fig. 25. An illustration of retarded and advanced interactions. (a) A retarded interaction 

between points x, and x,. The Green’s function 6,,, differs from zero inside the shaded 

region (x, >x). (b) The same situation for an advanced interaction. The Green’s function 

Gaay differs from zero inside the shaded region (x2 < x,). (c) The points x, and x, cannot be 
related by any interaction. 


the effect of x, on x, in this case is described by G,,,(x); (c) when x, 
and x, both lie inside the spatial region, i.e. outside both the light cones 
(by convention x,~x,), both the Green’s functions G,4,(x) and G,,,(x) 
become zero so that there is no physical connection between the points 
x, and x. 

Using the Green’s functions we may write the field @(x) which results 
from any distribution in space and time in terms of a source Q(x). The 
differential equation for such a field is 


(0° — m’) o (x) =- Q(x). (25.5) 
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On the other hand, the retarded Green’s function ©,,,(x,—~,) satisfies 
the equation 


(03 = m’) Gie (X2 — x1) = — ô (x2 — x1). (25.6) 


Multiplying this equation by Q(x,) and integrating over d*x, gives 
(02 — m°) { Gre (X2 — x1) Q (x1) d*x, = Q (x2). (25.7) 


Comparing this with (V.4) shows that the field ọ at the point x, is equal 
to 


P(X2) = Po (x2) + | (x2 — x1) Q (x1) d*x,, (25.8) 


where pọ (x2) is the solution to the homogeneous equation [Eq. (25.5) for 
Q=0]. 

If the field is formed solely by the source Q(x,), then @o(x,)=0, and 
in place of (25.8) we have 


(x2) = | Bret (X2 — x1) Q (x1) d*x,. (25.9) 
In the simpler case, for m=0, on the basis of (25.4) we have 
(x2) = | nan Q (x1) dfx; (25.10) 
(here r= |x, — xl) or 
p(x) = f eens dx,. (25.11) 


From this we see that the field p(x) at the point x,(t,, x2) consists of 
retarded effects that start from the points x, at ft; =¢,—r/c(c=1). This is 
the most clear expression of causality in the classical theory of fields. 

For m#0, according to (1.3) and (25.3) the expression for the field 
@ (x2) is more complicated, namely, 


9 (x2) = | Prac d?x, + = 
r 2 


= (t2 — tı)? — r°) 


(t2 — t4} —r? 





Q(t, x) dt d3x,. (25.12) 


Q 
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The range of integration in the last integral is limited by conditions 
(t2—t,)*>r?, t)—t,>0, in agreement with (25.3). The relationship 
v=r/|t,—t,|<1=c may be considered as the propagation speed of the 
interactions which occur because the “particles” have a velocity v that is 
less than the velocity of light. 

We now assume that there is a charge distribution Q, (x2) in the region 
around x,. We defined the quantity 


W= fo (x2) 9 (x2) d*x, (25.13) 


as the “action function” (the integral over time of the interaction energy). 
Denoting the source of the field in the region x, by Q; (x1), we obtain 


W= fo (x2) © (x2 — x1) Q; (x1) dfx; dfx; , (25.14) 


where G (x2 — x1) = Gye, (%2 — X1) — Gaav (X2 — x1). 

Such a notation accounts symmetrically for the action of one charge 
on another and, because of (25.3) and (25.3’) it assures a delayed effect 
of charge located “earlier”, on a charge located “later”. 

We now take the second functional derivative of the action W with 
respect to the functions of the sources Q, and Q, at the points x, and x,, 
respectively. We obtain 


ôW 
5Q2 (x2) 6Q; (x1) 
From this it follows that 
ôW 0 
ôQ 3 (x2) 6Q, (x1) ‘ 
if x, ~x,, i.e., if the points x, and x, lie with respect to each other outside 
the light cone so that the interval (x,—x,)? is spatial. Equation (25.16) 
expresses the physical fact that the influence of one charge on another 
cannot propagate with a speed greater than that of light. 
In concluding this section we note that the Green’s functions, which 


are convenient for describing causality, are in themselves only mathe- 
matical abstractions because the point source 


Q (x) = gd (t) 6° (x), (25.17) 


G(x. — x). (25.15) 





(25.16) 
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that generates the field described by the Green’s function is inconsistent 
with the concept of charge conservation. The conservation of charge re- 
quires that the continuity equation 


£ +divj=0, (25.18) 


be fulfilled, where j is the current density corresponding to the charge Q. 
From this equation it follows that, in the case of a point source 
(g6(t)d3(x)), the current density j(x) must have the form 


 _ dd) fl 
ix) = -g dr v(-), (25.19) 


div vE) =V? C) = ô? (x). 
r r 


Thus the current which corresponds to the point source Q [see (25.17)] 
appears to be distributed in space, and is highly singular in time (deriv- 
ative of a ô-function). 

A more real case is that of a point source which moves along some 
trajectory 





because 


x(t) = X(t). (25.20) 
The charge and current densities in this case have the form 
Q(x) = gd? [x — X9], (25.21) 
dX (t 
jx) =g xo ô’? [x — X(o]. (25.21’) 


It is easy to show that these quantities satisfy the continuity equation 
(25.18). An insertion of these expressions for Q(x) and j(x) into (25.9) 
gives the scalar and vector potentials 


g 
7 r[x- X(t- r/o] 
g dX (t — r/c)/dt 
r[x—X(t—1r/c)]’ 


where the points ¢, x can be reached by a “signal” from a moving point 





(x) (25.22) 


A(x) = (25.22’) 
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charge that lies on the surface 


R= E 2 x(: a ajj (25.23) 


Figure 26 shows the transmission of a signal from one particle (x,) to 
another (x2) by means of a retarded Green’s function G,,,(x2—). 


Fig. 26. Retarded interaction between two “point” particles; x, =X, (t) and x, =X, (t) 
are the locations of these particles at the time t, =t, =t, x, =X,(t—R/c) is the effective 
location of the particle l. Here R is the distance between the points x; and x3. 


We now consider the potentials g(x) and A(x) at some distance from 
the point X(t—r/c). For definiteness, we assume that the motion of the 
charge originates in the bounded region |x|<M. Then 

e xX (t — r/c 
N CL GIN 
r (x) r 
X(t — 
A(x) = eX tro 
r(x) 


In this region of space R, (x) the potential o (x) does not have any mean- 
ing for causality because the first term e/r (x) is static and, therefore, it is 
not related to the transmission of any signal. The second term is of the 
order O(1/r?). Thus only the vector part of the potential A(x) can trans- 
mit a signal. 


(25.24) 





o (x) 


(25.24’) 
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We further assume that the current j= eX6°(x — X) is generated by an 
oscillator to which motion is imparted by the force F(t) of the form 


: 3 1 
X+yx+@°X =—F(t), (25.25) 

m 
where w is the characteristic frequency of the oscillator; m, its mass; and 


y the coefficient of friction. The most abrupt motions of the oscillator that 
we can conceive arises from an instantaneous blow, i.e., in the case when 


F (t) = Aô (t). (25.26) 
Noting that 
X (t) => | G(t — t) F(t) drt, (25.27) 


where G(f) is the Green’s function of Equation (25.25): 


exp(— (37) t) 
(exp (t./ 47? — 0”) 
y? — 4w? 





G(t)= (25.28) 
— exp(— t,/4)” — w)), t>0, 
0, t<0 
from (25.26) and (25.27) we find that 
A 
X(t) =— Git). (25.29) 
m 
If w> y/2, is a small friction, then 
: A 
X(t) =—e7”** cosat ; (25.30) 
m 
and for a large friction 
Kip = erm, (25.31) 
m 


In the first case, maximum current is obtained for t~ 2/y and the current 
flow lasts for about the same length of time. In the second case, maximum 
current is obtained for t=0, and also flows for a time of the order of 
t~2/y. The width of the impulse 4t is defined in both cases as y/2. 
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In this manner the extremely small impulse can be transmitted by an 
oscillator which has an extremely great friction. This simple estimate 
illustrates the method by which a source that generates short signals 
could in practice be obtained. 

In classical (macroscopic) physics, no fundamental restrictions are 
placed on the dimensions of the regions in which sources are localized, 
or on the frequency of their vibrations, or on their coefficients of friction. 
In particular the charge may be concentrated in an arbitrarily small 
region of space [this is expressed in the computation by the d-function 
ô? (x—X(a))] and may have an arbitrarily small amplitude of oscillation 
A for any frequency w and any coefficient of friction y. This situation, 
which is typical of the consistent theory, does not in itself present any 
limitations. 

The limitations are external ones. Decreasing the size of the sources 
would in the end lead to sources that have molecular dimensions and 
thereby would result in a microworld region. The ability to choose sources 
arbitrarily, independently of their mass, frequency, friction and oscilla- 
tion amplitude would be significantly decreased. Heisenberg’s uncer- 
tainty principle, which relates the space-time localization of the source 
to the uncertainty of its energy AE and momentum 4p, is highly signific- 
ant under these conditions. 


26. CAUSALITY IN QUANTUM FIELD THEORY 


In the quantum theory of fields the propagation of interactions of one 
region R(x,) to another R (x2) is described by using quantized fields. 
The operators which represent these fields satisfy the commutation laws 


[see (14.3) and (16.2)]: 
for the scalar complex field @(x) 

[ (x2), @* (x1)] = iD (x3 — x), (26.1) 
for the scalar real field @(x)=@* (x) 

[ê (x2), @ (x1)] = iD (x2 — xı), (26.2) 


for the vector field 4, (x) 
[A, (x2), A,(x;)] = iD (x2 — x1) 6,, (26.3) 
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and for the spinor field p, (x) 


a (X2)s Wp ED} = Sag (x2 — x1). (26.4) 


The remaining commutators are equal to zero. 

The functions D(x, —x,) and S,ș (x2 —x,) describe the correlation be- 
tween the fields in various points in space-time. An important property 
of these functions is the fact that they vanish for space-like intervals, that 
is for (ta —t,)? — (x2 —x;)? <0. Therefore, the correlation between values 
of the field that are separated by the spatial interval x, ~x; is absent.t 
We may interpret this as an absence of some causal relationship between 
the values of the field, taken in the regions R(x.) and R(x,) which are 
located in a space-like manner next to each other so that for all points 
XER (x2) and x,ER(x,), X2~Xy. 

In this relationship, the commutation conditions * (26.1)-(26.4), which 
are called the conditions of local commutativity, may be considered as 
conditions of microcausality in quantum field theory. We note that 
other formulations of microcausality, which we shall describe later, 
imply in one way or another that the conditions of local commutativity 
of the form (26.1)-(26.4) are fulfilled. 

The quantum field operators may be divided into two parts; one 
which contains the operators of particle generation (this is the positive 
frequency part) and the other which contains the operators of particle 
annihilation (this is the negative frequency part). In order that we 
may not make the following discussion cumbersome by using super- 
fluous notation and indices, we turn to the case of a real scalar field @ (x). 
The expansion for this field into a positive frequency part ĝ* (x) and a 
negative frequency part @ (x) has the form 


G(x) = ĝ* (x) + @ (x), (26.5) 
At i (wt — kx) 
@t (x)= | aa a dk, (26.6) 
w 


t? In the case of the spinor field, the anti-commutator {A, B} and not the commutator 
[A, B] becomes zero. However, using (26.4) it is easy to show that when physically observ- 
able bilinear combinations of the form w(x) rý (x) (where I is some spinor matrix) are 
are taken at two points x, and x,, their commutator vanishes. 

t We shall not discuss here axiomatic studies which allow somewhat less stringent condi- 
tions of commutativity with vacuum averages. 
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i A k —i(œt-— kx) 
p` (x)= [oss dk, (26.6’) 


J 


where w=(k? + m?)!/?, From (26.6) and (26.6’) we may see that ĝ* (x) 
contains only the generation operators â* (k) and @~ (x) contains only 
the annihilation operators d(k).1 Therefore the operator @* (x,) is an 
operator of particle generation at the point x,, and the operator @~ (x2) is 
an operator of particle absorption at the point x. The operators @~ (x3) 
* (x,) will be the operator of a particle propagating from the point x, 
where it is formed, to the point x, where it is absorbed. For this case, 
t,>t,. For t,<t,, the operator ` (x1) @* (x2) describes the same 
situation for t, > t32. 

The matrix element of these operators are influence functions in the 
quantum theory of fields. The only elements that differ from zero are the 


vacuum averages, i.e., 
(Pol @ (x2) G* (x1) |Po> for t2>t,, (26.7) 
Po ÔT (x1) @* (x2) |Po> for t2< ty. (26.7') 


Here wo is a functional of the field which describes the vacuum state. 
Both expressions may be combined into one if we utilize the concept of 
chronological ordering of the products, “the 7-products,” of two oper- 
ators. By definition, the 7-product of a field @(x,) taken at the point x, 
and the field @(x,) taken at the point x, is equal to 


A A _ JÊ) (x) if t >t, (26.8) 
TORERE pene if ty <n. (2638) 


Note that the 7-product is relativistically invariant. In fact, for the 
time-like interval x,2x,, the inequality ¢,;272, is invariant with respect 
to the Lorentz transformation. For the space-like interval x.~x,, the 
operators @(x,) and (x2), according to (26.2), commute and therefore 
their order in (26.3) is unimportant. Noting that @~ (x) Wo(0)=0 (the 
“stability of a vacuum’’) it is easy to see that both the equations (26.7) and 
(26.7’) are united in the equation 


<Pol TÊ (x2) Ê (x1) |Po> = iD. (x2 — x1), (26.9) 


t It is possible to expand any quantized field ĝ, A, etc. in this manner. 
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where D,(x) is the causal function that is defined in Appendix I [see (I.5)]. 
The physical meaning of the equality (26.9) is contained in the fact that 
the left-hand side is the average of all processes in which a particle is 
produced at one point and absorbed at another. 

The function D,(x) has the form [see (I.5)] 





D. 6) = + 50?) = 00) 


= [h om 2) — iN, (mn / 
82 ./x 


+ LY ERA g (m./— x?). (26.10) 


4n?./— x? 


This function accounts symmetrically for the advanced and retarded 
interactions, similarly to the classical Green’s function G (x) in (25.14). It 
differs significantly from the classical action function in that it is not 
equal to zero, not even outside the light cone. In fact, for x? =(t,—t,)” 
—(x,—x,)? <0, from (26.10) it follows that 


im 
D. (x) = ————— K, (m,/-— x’). (26.10) 
yaw 


4x /— x? 


Note that for m(—.x?)'/?> 1, this expression takes on the form 


A 1 1/2 mae 
Do G ——] en T, (26.10”) 


4n./— x? 2 m./— 2 
that is, outside the light cone the function D,(x) decreases exponentially 
over a distance of a Compton wavelength of the particle h/mc [in equa- 
tions (26.10), (26.10) and (26.10”) we use h=1, c= 1). 
For the case m=0, outside the light cone we have 


i 
D. (x) = tax’ (26.11) 
that is, D,(x) decreases outside the light cone more slowly than for m<0 
[see Appendix I, Equations (1.6) and (I.8)]. 

At first glance, this characteristic of the function D,(x) may appear 
to contradict causality, insofar as interactions can propagate outside 
the light cone. We shall now show that because of Heisenberg’s un- 
certainty principle, such a contradiction does not in fact arise. To prove 
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this important proposition we consider two microsystems (particles 
or particle systems) A and B that are separated from each other in 
space. 

Let an event a occur in system A, and an event b in system B. Let 
event a precede event b so that x,<.x,, where x, is the coordinate of the 
event a. We assume that events a and b may be distributed in space time 
regions R, and R,, and so x,, x, are points in these regions (all or part). 

We shall consider that b is a consequence of event a (cause), if the system 
A as a result of the event a transmits a “signal” to the system B which 
initiates the event b in B. Thus we assume that in this causal interaction 
process the energy E, of the system A decreases (it looses energy by send- 
ing out the signal), and the energy Ep of the system B increases by receiv- 
ing the signal. Correspondingly, the momenta of the systems must change. 
In other words, the signal carries a positive energy e=hw and a cor- 
responding momentum p= hk. These assertions have meaning only in the 
case that the energy spread AE and momentum spread Ap x AE/v (v is the 
velocity of the system) in the systems A and B are small in comparison to 
the quantum energy and the momentum 


AE, < he, AEs < ho, (26.12) 
h h hook 

Mee, E (26.12) 
L c L c 


where L denotes the dimensions of the region in which the microsystems A 
and B are located (we consider these dimensions to be the same). If 
these inequalities are not fulfilled, we cannot specify which of the 
systems, A or B, sends out the signal and which absorbs it. 

We see that a coordination of events in terms of cause and effect 
requires simultaneous space-time localization of microsystems and that 
the energies E and momenta p be sufficiently well defined. This fact has 
a fundamental meaning for causal problems in the microworld. We may 
study the causal relationship only between two wave packets which are 
diffuse and are not sharply defined objects in space-time. Therefore, the 
causal relationship and causal order of events in the microworld may be 
determined only to within the limits of certain “tolerances”, the magni- 
tudes of which are defined by the spatial dimensions of the wave packet 
Ax= L. This uncertainty in spatial dimensions corresponds to the un- 


t In the opposite case we must consider A + B as one system C that cannot be divided. 
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certainty in time At2Ax/c. These considerations have an important 
meaning for the definition of the concept of macroscopic causality in 
elementary particle theory (see Chapter 6). 

We shall now illustrate these general considerations with a simple 
example. We shall consider two wave packets (see Figure 27a). The 
first packet is located inside the space-time region R(x,) and has a 
spatial extent of Axx L and a time duration At% T. We shall consider 
that this wave packet has an energy E to an accuracy of AE~h/T and a 





(a) (b) 


Fig. 27. (a) The packet in the region R(x.) is located later than the packet located in the 
region R(x,) (R(x2)>R(x,)). The interaction of the packets depends on the packet in 
R(x,) emitting a quantum of energy hw (> 0) which is then absorbed by the packet in R (x3). 
(b) The packet in (x2) is located earlier that the packet in R(x,) (R(x2) <R(x,)). In this 
case the interaction takes place by the packet in R (x,) emitting a quantum of negative energy 
ha(<0) and this being absorbed by the packet in R (x2). This process is equivalent to the 
packet in R(x,) emitting a quantum with positive energy which is then absorbed by the 
packet in R(x,). 


momentum P, to an accuracy AP, ~h/L (we assume that £,>4E,, 
P,>AP,). The second packet is concentrated in the space-time region 
R(x). We consider this packet as having the same length L (and 
duration), an energy E > AE, %hħ/T and a momentum P,>4P,~h/L. 

The charge of the first packet Q(x,), which produces the field ọ (x2), 
may be written in the form 


O(x,)=9 (3) p (=) eaii, (26.13) 
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where the spatial part — the function p(x,) — differs significantly from 
zero in the region R, (x,) =L’ near the point x,. œ is the characteristic 
frequency of this quantum jump which occurs in the wave packet. The 
function g(t,/T) differs significantly from zero in the region R; (t,)T, 
R (x1) = R; (x1) x Rı (t1). We may consider this as the magnitude of the 
charge of the wave packet, which switches on and off adiabatically (i.e., 
ina time T> 1/q@), about the moment in time t, =0. We may, for example, 
write 


t) 1 T x\ 1 L 
(i-i (=i eet) 

This process of adiabatic switching on and off of a charge violates the 
law of charge conservation and, therefore, it has a purely formal mean- 
ing. We resort to this artificial process only to separate out the effect 
which is produced by the field source, not only from the region of space 
®,(x,)=L, but also from the specific time ~ T. 

A far more realistic case would be that of a wave packet which moves 
along some trajectory. Then the “switching on” and “switching off’ of 
the region ®;(x,) would automatically take place in the time Tx L/v, 
where v is the velocity of the packet. The adiabatic nature of the switch- 
ing on and off of the region R, (x,) will be generated if 


L 1 
Tz=>— (26.15) 
v w 
or 
L> oA, (26.15) 


where 7 is the wavelength of the quantum that is emitted or absorbed by 
the wave packet. 

We shall now consider the quantum analogue of the action function 
W (25.14). By definition the causal function (26.9), in quantum theory, 
plays the part of the Green’s function which relates events in one region 
of space-time with events in another. Therefore, 


W = i foa Dz (x2 — x1) Q(x) d*x, dx, . (26.16) 
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From this we obtain 
ow 
6Q (x2) ÔQ (x1) 
and in contrast to (25.14) this quantity does not vanish outside of the 
light cone. 


In order to illustrate the properties of the quantity W, we compute 
the field p(x) produced by the packet Q(x,) in the region R(x,): 


9 (X2) = Je. (x2 — x1) O(x,) d*x,).1 (26.18) 


The most weakly attenuated function D, (x) outside of the light cone is 
one for m=0, that is for a field which does not have particles whose 
rest mass differs from zero. Therefore, we shall use this case in the 
following computations. 

For m=0, according to (1.6) and (1.8) we have 


= D, (x2 — xı) (26.17) 


D, (x2 — x1) = Öki (x2 — x1) + Spay (X2 — xı) 


1 : 
ro [ew Liv (t3 — t; — r)] dv 
0 


1 
-— [ex [— iv(t, —t, +r)] dv, (26.19) 
2n 
0 
where r= |x,—x,|. Placing this expression for D,(x,—x,) in equation 
(26.18), and using (26.13) we obtain 


ce +o 
1 x,/L) d°x, f dv 
paje p(x,/L) | f an 
8x r 2n 
0 =o 


x {| exp (ict, + iv (t, — t,)) exp(— ior) a()] 


+ (x Gwt, — iv(t, — tı) — iwr)g ())}. (26.20) 


t Because of the symmetry properties of the function D.(x,—x,) we could also have 
computed the field @(x,) which is produced by the charge Q(x,) in R(x,). The result 
would be symmetric with respect to the regions R(x,) and R(x,). 
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Carrying out the integration we obtain 


00) = [eee a°x, exp[io( — 1) (2 > r) 
T r 








T 
1 [{ p(x,/L) 3 é tatr 
Par ae x, exp[—iw(t, +n] g 7S 
(26.21) 


In order to separate the events in the space-time regions R(x,) and R(x.) 
it is necessary that t, >T and rẹ L. From this it follows that for t,>0, 
only the first term in (26.21) is important for it is the only term that 
differs noticeably from zero for t, =r (to an accuracy At% T). Therefore, 
this term corresponds to the retarded interaction (t,=r/c, c=1 is the 
speed of light). The condition rẹ L leads to the inequality wr/c>1, 
Le.,r>Z. 

In this manner the necessity for space-time localization to be com- 
patible with the assumption that the energy and momentum of a wave 
packet may be sufficiently well defined, leads automatically to the fact 
that causality is distinguished only in a wave zone. 

We must still consider the case t, <0. This does not lead to anything 
substantially new. In this case the second term in (26.20), which describes 
advanced effects, is important. However, from (26.21) we see that the 
sign of the frequency œ of the quantum which transfers the interaction 
changes to the opposite sign. Therefore, for t, <0 in the region R(x,) a 
quantum is emitted with negative energy from the region R(x,) and goes 
to the region R (x2) by the advanced field. The packet Q(x.) “absorbs” 
the quantum with negative energy —fw. This description is obviously 
equivalent to the assertion that for t, <0 the wave packet Q(x2) emits a 
quantum with positive energy which goes over to the retarded wave 
packet where it is absorbed. 

Thus the function D,(x,—.x,) describes a process of interaction by 
means of a signal with positive energy which is symmetric with respect 
to both the space-time regions R (x,) and R (x2). This process is illustrated 
in Figure 27. As for the “anomalous” behavior of the causal function 
D,(x2—,) outside the light cone, it is obvious from the foregoing cal- 
culations that the uncertainty relationship does not allow particles to be 
found violating any of the laws of causality [52, 53]. 
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27. THE PROPAGATION OF A SIGNAL “INSIDE” 
A MICROPARTICLE 


In the preceding section we showed that although the causal function 
D,(x) defining the propagation of interactions in the quantum field 
theory does not vanish outside the light cone, it is nevertheless impossible 
to find signals propagating with velocities greater than the velocity of 
light, particularly with infinitely large velocities. 

This “paradox” of the D,-function was investigated for the case of 
the interaction between two point particles. 

A similar paradox arises for microparticles that have finite dimensions. 
The essence of the matter is contained in the fact that the form factor of 
the particles F(q) (Section 22), where q is the momentum transfer, 
does not vanish for spacelike values of g and therefore a rigid spatial 
distribution of charges and currents may be assigned to the particle 


p(x) = [Fo el dq. (27.1) 


These are able to transmit the signal with an infinitely high speed. In 
order to explain this situation, we limit our investigation to the inter- 
action of a virtual photon with a spinor particle. 

We shall describe the photon field k by the vector potential 4,(x) 
and the spinor field of the particle, which has a mass, by the operator 
@ (x). To the first approximation in the charge e, the interaction operator 
may be written in the form 


W= fie J, (x3, X2, X1) Ý (x1) A, (x2) d*x, dfx, d*x,, 
(27.2) 


where J (x3, X2, X,) is the density of the current induced by the spinor 
particle (see Figure 28). The amplitude of the probability of the particle 
transition from the state i to the state f caused by this interaction is equal 
to 


Ty = > <f| W iD = fen J, (X2) li> Ay (x2) d*x2, (27.3) 


where ¢ f|J,,(x2)| i> is the matrix element of the current density J, (x35 
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A(x) 


X2 


Y y x3 
x 
Fig. 28. A graphical representation of the current J, (x3, X2, X1) for a spinor particle which 


has finite dimensions. x, is the particle entry, x, its exit, and at x3, it interacts with the 
external field A(x). 


X2, X,) which refers to the transition i>/: 


<fl J, (x2) liò = ZS J, (x3, X2, X1) Yi (x1) d’x3 d°x, . 
(27.4) 


We shall consider the wave functions J, and y; as arbitrary with the 
restriction that they are superpositions of states with positive energies: 


Uy (x3) = | ct (p3) üş (p3) exp(— ip3x3) d*p3 ’ (27.5) 


W (x1) = fe (p1) u (pı) exp (iprx1) d°Py ; (27.5) 


Here ùü, u; are Dirac’s bispinors, Cf, c; are amplitudes of states with 
momenta p3, p, and p$, p?>0. Denoting p;—p,=q and using the re- 
lationship 


<P3l J, (x2) IP = fa ip3X3) a, (x3, X2, X1) 


x exp(— ipıxı)dtx; d*x, d*x, 
= J, (q) exp (igx2), (27.6) 
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we obtain from (27.4), (27.5) and (27.5’) 


<fl J, (x2) lid = fa (P3) ¢;(P1) ur (P3) J, (q) 4; (P1) d*p, d*p; > 
(27.7) 


where J (q) is the current operator in the momentum representation. 
According to (22.15) this operator may be written in the form 


J,(Q) = eF, (47) Y, + HOw 4yF 2(q’), (27.8) 


where e is the charge of the particle, u is its magnetic moment and F; (q°) 
F,(q’) are the electric and magnetic form factors of the particle. When 
defined in this manner, the quantities F, and F, are dimensionless and 
must depend only on the ratio q?/q2, where qm is some value of q which 
specifies the rate of decrease of F, and F, for |g|-0o. The particle 
dimension R is related to qm by the equation R=h/qp,. 

In order that we may investigate the propagation of a signal inside a 
particle, we shall consider the states y, and y; as localized wave packets. 
We shall also consider the photon field A, as localized.' For definitness, 
we write tis 

cs (p) = -3z exp {— 2 bF (Pp — Py)” — ipxy}, (27.9) 


3/2 


H 
ci(p) = -37 exp { —3 b? (P — Pi)? — ipx,}, (27.9') 
pee a, exp {— i[k(x — X) — ko (t — T)]}} 
1 1 
x apf- 5,2 -XP -— zZ [x —(X- ony. 


(27.9") 


The wave functions (27.5) and (27.5’) represent wave packets which have 
average momenta p; and p,, and are localized at t=0 about x=x,, x 


with rms deviations 
2 


po 
4(P— Py) = Fe 


where the coefficients « and Bx 1, and m is the particle mass (see Section 
13). The same relationship holds for (27.9’). 


ete h 
A(x — x,)? > ab? + Bobs, (27.10) 


t See Appendix VIII, Equation (VIII.4). 
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The field A,(x) represents a wave packet of virtual photons with 
the characteristic wave vector k(k, kg) and “velocity” of propagation 
u, which is localized at T=0 around x =X in a region whose width is of 
the order of Axxa, Atxa/c. The choice of this form for the virtual 
photon packet corresponds to the assumption that it could be emitted 
by a real physical particle. 

We note that if we could have used infinitely narrow packets in 
space and time described by the 6-functions ý -(x3) =65* (x3 — xp), Yi (xı) 
=6*(x,—x,), A% ôt (xı— X), then from (27.2)127.4) the amplitude of 
the transition would be immediately equal to J, (x,, X, x;), and we would 
have established the fact that signals can propagate with the forbidden 
velocity. 

We may not, however, use wave packets whose dimensions are arbi- 
trarily small. In fact the ability of a particle to receive signals is contained 
in the established fact that its momentum changes from p;—-p,. This 
change has to be greater than the momentum dispersion Ap,~h/b,, 
Ap,;h/b,, Ak~h/a in the initial and final packets, i.e., (considering 
b,xb;~b) 


h h 
IPs — Pil = lad > p |k| > -. (27.11) 
a 
From this we obtain the inequality 


h 
IP rl, Ipi] > p (27.11’) 


On the other hand, the dimensions of all three of these packets must be 
smaller than the dimensions of the particles. 


by & R, b; &R,a &R. (27.12) 


Because the conditions (27.11)-(27.12) do not contradict quantum 
mechanics, then at first glance it appears that a causal possibility exists 
of detecting a signal that travels inside the particle with a velocity 
that is greater than that of light. However, further calculations show 
that this is in fact an illusion. 

In order to prove this assertion, we calculate the probability amplitudes 
T,; for the wave packets w,, Y; and A that satisfy the conditions (27.11), 
(27.11’) and (27.12). This amplitude defines the intensity of the signal 
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that is received at the center of the particle and in its peripheral regions. 
According to (27.3) we have 
i b? 3 3n g3 
Ti a d°x dt, | dp; d°p, 
x exp {— 4b? [P3 — P)? + (Pı — Pi)”] + igx2} 
x uF (p3) F (q) u: (pi) exp [ — ik (x2 — x,)] 


1 
x exp| - za i: — XP + [(x, — ut) — (X — un | 
(27.13) 


In order to avoid unnecessary calculations we set x;=x,=0 and by, b; 
=b. We shall also denote F (q)=a,74J,(9). If the dimensions of the 
packets b and a are much greater than the quantities h/p,, ħ/p; and h/k, 
then the integration in (27.13) may be performed immediately and it 
gives the usual result of scattering theory: 


2ni 4 
Ta = E 8t (k = D) uf (Pp) F (ar) u (P): (27.14) 


However the coordinates of the place from which the signal is emitted 
(point x,) and the places at which it is received (points x, and x3) do not 
appear in this equation at all and, therefore, they remain unidentified. 
We now turn to the case of packets having small dimensions a and b 
which nevertheless satisfy the conditions (27.11), (27.11’) and (27.12). 
Because of the assumption |g,;|>h/b> qm (the momentum of the particle), 
the factor F(q) may be taken out from the integral. We also take the 
slowly varying quantities uf (pp), u;(p;) out from under the integral. 
Further, we note (see Appendix VIII) that 


qo = E(ps)- E(P) = E- E +V- VM+, (27.15) 
where 


E =P -Po NSP Pe V =VE, Vi= VE. (27.15) 
Integrating now over č and y we obtain 
i : 
Ty; = h uF (D5) F (api) u (pi) exp (KX — koT)] fex die PY, 


(27.16) 
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where 


; ; 1 
(x, 1) = i(k — qx — ilko = qo) t + 555 — Vit? 


1 1 
+ aq2 -XP+ z7 Lx — ut) — (X —uT)]? 
(27.17) 


where the notation has been shortened to q,;=q, Es — E= qo- 
In this manner the amplitude T; is defined by four wave packets which 
overlap in space and time. This situation is illustrated in Figure 29. It is 





Fig. 29. An illustration of a signal propagation inside a microparticle. Overlapping wave 
packets in the initial (p,) and final (p,) states resemble a 6*-like source. A represents a 
packet of photons. The interaction arises when all these packets overlap. 


essential that these packets at the points x, t~0 and x»X, ¢~T are not 
only bounded in space but also in time so that they to a certain extent 
resemble ô like signals of the form ôt (x). 

Performing the last integral in (27.16) we find that 


Tp = uf (By) F (ay) u; (P) exp [i KX — koT)] 


x exp[— ¥(X, T)], (27.18) 
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where W(X, T) is equal to (see Appendix VIII) 





1 C? 
Y(X, T) = 5a X? + 55K - ary - ap OX — ul]? 
A? 2 
a (X — uT, u? — a [(2X — uT) 


C? 
x (X — uT, u)] + Fa (k—ay 
A? 3 
+ Falko ~ do}? + Im Y. (27.19) 


The coefficients 4?, B?, C?, A? are functions of the widths of the 
packets a and b and of the three velocities V;, V, u. This quadratic form 
is positive definite and the exponential factor exp[— ¥ (X, t)]—0 for 
large |X| as exp[— X?/d?] where d? xa? (or ~ b°). This indicates that the 
transfer of a signal from the periphery to the center of a particle may be 
observed only if the distance between the periphery and the center |X] is 
not greater than the width of the packets a or b. 

However, in this case, the wave packets overlap strongly inside the 
particle and the momentum transfer takes place by direct contact. The 
fictitious spatial distribution of currents J,(x3, X2, xı) which could 
transmit a signal with infinitely high speeds does not play any role in this 
case. It is also impossible to observe a spatially extended particle trans- 
mitting a signal which has a velocity greater than that of light [54]. 

We may consider the results of Section 26 and the present section as 
showing that in the framework of quantum field theory, it is possible 
to introduce deviations from the accepted form of causality which, 
because of the uncertainty relation, will be compatible with the classical 
concepts of causality. 


28. MICROCAUSALITY IN QUANTUM FIELD THEORY 


In quantum theory, the state of a system is defined by the wave function 
WY, which is a vector in Hilbert space and depends both on time and the 
dynamic variables Q,, Q2, Q3,..., Qn that characterize the system. In the 
case of the field theory, the number of such dynamic variables is infinitely 
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large (N03) because the instantaneous state of fields in all space 
R, (x) has to be defined in terms of these variables. Thus in this case 
the wave function is a functional and depends on some functions A(x), 
B(X),..., which characterize the field. Furthermore, from relativistic 
considerations it is advisable to consider ¥, not as function of ¢, but as 
a functional of some space-like surface c =o(x), which only in a special 
case degenerates to the plane t=o(x)=constant. 

To stress the functional dependence of the wave functional ¥ on the 


surface o(x) and fields A(x), B(x), ..., we write the arguments of ¥ in 
curly brackets. 
Y = Y {o (x), A(x), B(x),...}. (28.1) 


The normalization of this functional is defined by the condition 
(F, Y= for dQ {A (x), B(x),...}, (28.2) 


where the integral is taken over the space of functions A(x), B(x), ..., 
must be conserved in time; in other words it must not depend on the 
choice of surface o (x). 

We now consider the value of the field functional ¥ on the surface 
oa’ which differs from the surface o only in the neighborhood of the point 
x and it lies after ø (see Figure 30). If the difference between the surfaces is 
infinitely small 


o' (x) = a(x) + ôo (x), (28.3) 


a(x) =0 (x) +50 Ode cry 


x x+dx 


Fig. 30. The space-like surface o(x) and its variation in the region (x, x +dx). 


then we may write the new functional Y'= ¥ {o’(x), A(x), B(x),...} in 
terms of the functional derivative 6¥/da(x) of the functional on the 
surface a(x) at the point x (see Appendix IX) 

OY {0, A, B,...} 


3 
5a) 60(x)d°x. 


W = Y {o'} = ¥ {o, A, B,...} -f 
(28.4) 
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We note that the quantity ôo (x) d?x is an infinitely small fourdimens- 
ional volume contained between the surfaces ø and a’ (see Figure 30). 

We now turn to the principle of causality which we shall formulate 
classically: the state W’ is uniquely defined in terms of the state ¥ which 
precedes it. From this is follows that the increment of the functional 
defined in (28.4) must be uniquely defined by the same functionals. 

Furthermore, because the theory is linear, the relationship between 
increment of the functional Y¥’— and the functional ¥ itself must be 
linear. On the basis of these arguments and using the definition of a 
functional derivative, we write 





= in, 28.5 
FE į (28.5) 
where # is a linear Hermitian operator. The factor i is dictated by the 
requirement that the norm of the functional (¥, W) be constant. In fact 
by varying the norm (¥, P) it is easy to show that the variation 6(¥, Y) 
is equal to zero only in the case when the operator iff is anti-hermitian, 


i.e., if (if) * = —iĝ* = —if so that the operator 4 is Hermitian. It may be 
written in the form 
h=—Whh, (28.6) 


where W is by definition the Hermitian operator of the energy of inter- 
action of the field and it is a function of the operators of these fields 
A(x), B(x),.... 

Equation (28.6) may now be written in the form 


OV x 


in— aoe (28.7) 





This is the well-known Tomonaga-Schwinger equation. The functional 
derivative 6¥/da(x) and the functional ¥ are invariant. From this it 
follows that the operator W has to be relativistically invariant. 

For W=0, the functional ¥ is constant. This means that we have 
chosen the so-called concept of interaction representation when free, 
noninteracting fields are described by the constant in the time functional 
Yo. 

We now take the second variation of the functional ¥ and vary the 
same space-like surface o (x) in the neighborhood of some other point y. 
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From (28.7) we obtain 


ih Ai W (x) ee W (x) W(y) Y (28.8) 
ih ——— = W (x) —— = = W(x f ; 
5a (x) 60 (y) da(y) if í 

Performing the same variation in the inverse order and subtracting the 
result gives 


[W(x),W(y]=0 for x~y, (28.9) 


i.e., the operators of the energy of interaction of the fields, taken at 
various points x and y, must commute if the points x and y lie on a 
spatial surface so that they may not be joined by a light signal. 
Because the operator W is a function of the operators of the quan- 
tized fields A(x), B(x) the condition (28.9) will be automatically fulfilled 
when the conditions of local commutativity of fields are satisfied, i.e., the 
conditions of microcausality that were described in Section 25. Thus 
the condition (28.9) may be considered as an important result of the 
principle of microcausality. It was obtained for the first time in a slightly 
different form by Bloch [55]. 

In concluding we remark on one other important result of the caus- 
ality principle. This is that the change of a wave function ¥ after a finite 
time, for a transition from spatial surface o,(x) to o,(x) may be re- 
presented as a unitary transformation with the operator U (a, 6,) which 
is a sequence of infinitely small unitary transformations. 

In fact, it follows from (28.4) and (28.7) that? 


P = Y {o, + da(x)} = (1 — : W (x) da (x) ax}  {a(x)}. 


(28.10) 
By using this operator at all points on the surface o,(x) we obtain a 
value for the functional ¥” on the surface oj (x) which is later, but 
infinitely close to the surface ø, (x). From (28.8) we see that the order 
of the operators (1—(i/h) W(x)6,(x)d3x) is unimportant. In this 
manner we obtain the equation 


a1 
Y" = Y {ot (x)} = |: -> | W (x) ax P {o (x)}. (28.10) 


t Here we drop the assumption that ¥ depends on the dynamic variables 4 (x), B(x). 
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In the same manner we can go from the surface ø; to the surface 
m 


o{' which lies later than the surface of but infinitely close to it. Repeating 
this operation an infinite number of times gives 


Y {02} = U (02, 01) Y {01}, (28.11) 


where the unitary operator U(c,, c,) is equal to 
. E 
0 (a,,0,) = resp (—7 wow ax). (28.12) 


where the letter T denotes an ordering in time. The unitarity of the oper- 
ator U(o, c4) follows directly from the fact that the operator W is Her- 
mitian. We may consider Equation (28.12) for the operator U(a,, o,) as 
the microcausality principle in terms of wave functionals. 

From (28.7), (28.11) and (28.12) it follows that the operator U (03, o,) 
satisfies the Tomonaga-Schwinger equation. We write it in the special 
case when c; = — œ, o, =<. It is 


 6U (c, —0) | z 
ih —————— = W (x) U(a, — œ). (28.13) 
ôo 
In the special case for a plane spatial surface o =nx, where n is a 
unitary time-like vector (n? = 1, ng > 0), the variation 6a reduces to a small 
displacement dø on the whole surface ø in the direction of the vector n, 
so that da =dc d?x. Multiplying Eq. (28.13) by d?x and integrating over 
the three-dimensional volume gives 
dU (o,, — œ) 


ih ——-—_—- = W (a) Û (01, — ©), (28.14) 
do 


where 


W (0) = | W (x) d?x = | W (x) (øo — nx) d4x. (28.15) 


In this form the Tomonaga-Schwinger equation resembles the usual 
Schrédinger equation in the interaction representation.* 


* The relativistic nature of the interaction representation and the equation in the form 
(28 14) were first presented ın [56]. 
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We now return to the relationship between the operator U and the 
scattering matrix $. By definition S transforms the microsystem state, 
specified in the infinitely distant past ¥ (— œ) into the microsystem state 
in the infinitely distant future ¥ (+ 00): 


Y (+0) =S¥(—o). (28.16) 


From (28.11) it follows that ¥(— oo) is a limiting value of ¥ {o,} when 
the surface c; is moved to — œ, and ¥ (+ 0) is a limiting value of ¥ {03} 
when the surface a, is moved to + 0o.t Therefore § is a unitary operator 


$$* =f, (28.17) 
and it is the limiting value of the operator U{o,, c} for ¢,>+0, 


O17 -O: 


S$ = Ô {63,6} = rex(-1[ We) atx), (28.18) 


where the integration in the exponent is taken over all space-time R, (x). 
The matrix S$ is often replaced by the scattering amplitude matrix 
T which is related to S by 


S=f+iT. (28.19) 


The unitary relationship for the matrix 7, which follows from (28.17), has 
the form Ë 


i(ît-P=Pf*. (28.20) 


Denoting the particular initial by ¥;(¥;=¥(— œ)) and the final state 
by ¥,, we obtain the probability amplitude for the transition i>/: 


Sp = <P ASD, (28.21) 


and for i#f, the probability amplitude for the transition from state i to 
state f: 


Pre = (CP iT (PDP. (28.22) 
t In Section 32 we shall similarly consider the physical meaning of the limiting transition 


g7+0. 
t The optical theorem introduced in Section 23 [see (XII.4)] results from this relationship. 
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29. MICROCAUSALITY IN THE THEORY OF 
SCATTERING MATRICES 


Stuekelberg and his collegues [53] were the first to attempt to develop 
a theory into which -matrices could be introduced directly without 
using Schrédinger’s equation (28.7) which, in contrast to S-matrices, 
relates states which are infinitely close in space and time. We noted the 
principal importance of formulating the causality principle directly for 
S-matrices. The main result of this line of investigation was obtained 
later by Bogolyubov who introduced a somewhat formal, though very 
effective method for local adiabatic “switching on” and “switching off” 
interactions [38]. 

The scattering matrix S$ is a functional of the interaction energy 
operator. We introduce a second auxiliary operator 


W, (x) = g(x) W(x), (29.1) 
where the function g(x) can have any value in the interval from 0 to 1. 
If we construct a scattering matrix using the interaction W, (x), it will 


be a functional of g(x) whose magnitude defines the degree to which the 
interaction is “switched on” in a given region of space-time 


$= S{g(x)}. (29.2) 


For g(0)=0 the interaction is completely “switched off” and ${g(x)}=1. 
For g(x)=1, the interaction is completely “switched on” and S{g(x)} 
= 5S, the full scattering matrix (28.18). 

We now consider the special case when the switch on function g(x) 
differs from zero in two regions in space-time 
R, (x) and R, (x). The region R, (x) lies entirely 
after some space-like surface a(x), and the region 
R, (x) lies before it (see Figure 31). It is obvious that 


g(x) = 91 (x) + g2 (x), (29.3) 


where g, £0 in R, (x) and g, 40 in R, (x). The wave functional ¥ {o} on 
the surface ¢ may be obtained from the initial state ¥(— œ) using the 
S-matrix which depends only on the interactions in region R; (x). Inter- 
actions in the future, in the region R, (x) do not necessarily influence the 
state 


Y {o} = $ {g (x)} # (— œ). (29.4) 
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Fig. 31. Two regions ®, (x) and R, (x), the second of which lies entirely after (above) the 

space-like surface ø (x). In the region R, (x), the interaction is switched on with a force that 

is defined by the “charge” g, (x). In the region R, (x), the same interaction is defined by the 
charge g(x). 


The final state ¥ {g} is obtained from ¥ {ø} by using the operator 
S{g2(x)}? 


Y {9 (x)} = $ {g2(x)} $ {g, (x)} Y (= 0), (29.5) 
from which it follows that 
$ {g(x} = $ {g2 (x)} $ {g1 09}. (29.6) 


We now compare two cases that differ from each other only by the inter- 
actions in the region R, (x) so that 


g (x) = g(x) + (x) and g"(x)= g3(x) + gi(x). (29.7) 
We form the expression $(g") S* (g’). According to (29.6) we have 
$* (g1 + 92) = 8* (91) $* @), (29.8) 


t Note that because the interaction is switched off, on the surface ø the functional ¥ {o} = 
=Pl+oc, g(x}. 
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and, therefore, 

$ (g) S* (g’) = (g3) S* (95), (29.9) 
i.e., this expression does not depend on the past history of the system in 
the region 9R; (x). 

It is convenient to write this result in differential form. We consider 
that g5(y) differ from each other by an infinitely small amount so that 
69(¥)=92(¥)—92(y), and 6g(v) 40 in R, (y), i.e., in the region that lies 
after ø. In this case 





3 K 5S 

S(g") = S(g’) + | bg dg’ (y) dy. (29.10) 
Omitting the primes in (29.9) we may write this equation in the form 

S(g + ôg) S* (g) = 1 + (g) S* (9). (29.11) 


This expression need not depend on the behavior of g in the region R, (x). 
Therefore, the variational derivative of Equation (29.11) with respect to 
g(x) [the point xe, (x)] must be equal to zero. Therefore, 


ô (fg), ) 
—— sS =0, 29.12 
5900) (5 ore re 


if the point x precedes the point y in time (y>x). Equation (29.12) is 
relativistically covariant. In the case of a space-like interval between 
the points x and y (x~y), it is always possible to make y “later” than x 
by choice of coordinates. Therefore, (29.12) is satisfied even in the case 
when the points x and y are separated by a space-like interval. Equation 
(29.12) is thus an expression of the causality principle in terms of the 
scattering matrix. 

As was shown earlier in [38], on the basis of the requirements: (i) of 
relativistic invariance, (ii) unitarity and (iii) causality [in the form (29.12)], 
it is possible to construct an -matrix without using Schrédinger’s equa- 
tion. We refer the reader to reference [38] for the necessary proofs and 
restrict ourselves here to constructing this matrix to an accuracy of the 
second approximation of the interaction constant. Thus the following 
computations serve only as an illustration of the ideas presented above. 





t There are further mathematical requirements regarding the operators which enter into 
the S-matrix structure (see [38]) which must be considered. 
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We write the matrix $ in the form of a series of functionals in powers 
of g(x): 


1 ia 
S=1+ I [sense d*x, 


I fick 
+ 71 f Sy (x1, X2) g (x1) g (x2) d*x, dx, +: (29.13) 


Since we are considering that for a completely switched-on interaction 
g(x) =1, we assume that the operators S,, S5,..., S, are proportional to 
the powers £!, e?,..., €" of some small parameter which has the meaning 
of a coupling constant (“‘charge”).' 

Placing (29.13) for S in the unitary condition (28.7) we obtain, to an 
accuracy of é”, 

5, (x) + Sf (x) =0, (29.14) 
$a (x, y) + SZ (x, y) + $i (x) Sf) + S, (y) Sf) = 0 (29.15) 
etc.* 

From the first equation it follows that 


$ (x) = if (x), (29.16) 
where L(x) is some Hermitian operator 
L(x) = Ẹ (x). (29.17) 


Condition (29.15) is automatically fulfilled when (29.16) and the 
causality condition (29.12) are satisfied. Inserting the series (29.13) in 
(29.12) we compute the causality condition (to an accuracy of e°): 


$ (x, y) + Sy (x) Sf (y) = 0 (29.18) 


for y>x. 
Taking into account (29.16) we obtain S$, (x, y): 


5, (x, y)=—L(x) L(y) for y>x. (29.19) 


t For example, in electrodynamics ¢=e/(hc)'/?. 
+ Here we introduced the new notation x, =x, X, = y, which is more convenient as we limit 
ourselves to the second approximation. 
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From the equality in rights of the points x and y we have 

8,(y x) = Si (x, y) = — Ê) L(x) for y<x. (29.19) 
Both of these expressions overlap in the regions x ~ y and are comparable 
if 

[£(x), L(y] =0 for x~y. (29.20) 
This requirement is called the locality requirement. 


The equations (29.14) and (29.19) may be combined into one if the 
sign of the T-product is satisfied by 


$ (x, y) = PT [E(x Lo]. (29.21) 


Inserting the expression we obtained for $, (x) and §, (x, y) in (29.13) we 
find 


Sq@=1 +f [Eevee d*x 


i? 


tz 


T fft% L(y) g (x) g (y) d4x ayl +o. 


(29.22) 


Continuing this process, we may obtain the whole series (29.13) which 
may be summed into the symbolic exponential 


Sig) =T fei ft% g(x) ax}. (29.23) 


For g(x)=1, this equation agrees with the equation for the matrix 
(28.18), which was obtained from Schr6édinger’s equation for a wave 
functional, if in (29.23) we set 


a 


L(x) =- : W (x). (29.24) 


Therefore, the locality requirement (29.20) agrees with the conditions 
obtained earlier by Bloch (28.9). 

From (29.23) it follows that the requirement of relativistic invariance 
of the -matrix will be satisfied if, for the inhomogeneous Lorentz 
transformation, 


x, = Lyx, + a, (29.25) 
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(symbolically x’=Lx-+a), the switch on function g(x) is scalar and the 
energy of interaction operator W(x) is relativistically invariant. This 
means that, under this transformation, the following equation holds: 


g(x) > g (x) =g (L7 (x' — a), (29.26) 


or in other words, a new function g’ has the same value at a given physical 
point in space-time as the untransformed function g. 

Furthermore, the energy of interaction operator W(x) is a function 
of the operators of the fields Â (x), B(x) which transforms under Lorentz 
transformations according to the equation 


A (x') = SLŁÂ[L! (x — a)], (29.27) 


The form of the matrix S, depends on the covariant nature of the field 
(for a scalar field, S} = 1 ; for a vector field, it coincides with the matrix L 
so that S =L etc.). 

Because the operator A’ (x') gives exactly the same physical information 
as the untransformed operator A(x) according to the general principle 
of quantum theory, the operators must be related by the unitary trans- 
formation 


A'(x') = U,A(x) UL}, (29.28) 


where the unit operator U, is completely defined by the Lorentz trans- 
formation (29.25). 

If the energy operator Wis invariant, then the transformation will have 
the form 


W’ = W {ÂX} = W{A(x)} = W {ULA Uz} 
= U,W{A(x)} Up! = WLA} = Wr). (29.29) 


Because the S-matrix is a functional (29.23) of the operator W, it ensures 
that the matrix S$ is relativistically invariant. 

In concluding this section we note that the existence of singular 
functions like D(x), S(x) which enter into the commutation law as a com- 
ponent of quantized fields, and of the functions D.(x), S. (x) which enter 
into the T-product of quantized fields leads to the fact that the elements of 
the scattering matrix $ are divergent. In order to obtain physical sense 
from the results in the functional series (29.13), we must make certain 
changes that will exclude contributions from extremely small separations. 


CAUSALITY IN QUANTUM THEORY 159 


These “changes” are called the laws of renormalization. These laws 
are also discussed in existing literature (see [37, 38]) and we shall not 
repeat them in this monograph. We shall limit ourselves to mentioning 
that these laws which are brought in from the outside do not violate the 
basic requirements of relativistic covariance, unitary character and caus- 
ality when applied to the $-matrix. However, one cannot get rid of the 
impression that these renormalization rules support modern theory at the 
expense of elegance and logical harmony of the theory. 


30. CAUSALITY AND THE ANALYTICAL PROPERTIES 
OF THE SCATTERING MATRIX 


An important consequence of microcausality is the definition of the ana- 
lytical properties of the matrix elements of the scattering matrix T, (the 
scattering amplitude) which define the probability of quantum tran- 
sitions from some initial state i to a final state f. 

These properties were first discussed in classical electrodynamics by 
Kronig [57] and Kramers [58] who proceeded from the requirement that 
the propagation speed of a signal cannot exceed the speed of light in a 
vacuum and thus obtained the integral relationship between the real 
part of the refractive index Re n(q) (w is the oscillation frequency) and 
its imaginary part Im n (œ): 


Imn(a’) 301 
oo? — a) we 


00 
2w? 
Ren (w) = Ren (0) — — 
ys 
0 
Relationships of this type are called dispersion relations. The basis of 
these dispersion relations is Cauchy’s theorem. Their relationship with 
causality may be illustrated in the following simple example. 
Let a microparticle which scatters an incident wave be located at a 
point x=0. We write the incident wave in the form 
+00 


i (: E =) = a f exp[— iz(t — x/z)] a (30.2) 


2ri z — (@ — ie) 


— oo 





where e>0. 
We transpose the integration in (30.2) into the complex plane z =w +i! 
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and for t—x/c>0 we close the integration path by an infinitely large 
semicircle R=|z|>0o in the lower half-plane [<0 (see Figure 32). 
There is a “cutting” factor exp[I’(t—x/c)] in the integral so that for 
R- oo, the contribution from the integral taken over the semicircle is 
equal to zero. For t—x/c<0, the contour is closed in the upper half- 
plane, and the “cutting” factor exp[—I'(t—x/c)] for Rœ makes the 


r 





Fig. 32. Integration contours in the complex plane w for t >0 and 1<0. 


contribution from this semicircle also zero. Thus in the first case the 
integral (30.2) is equal to the contribution from the point z = œo — ie, and 
in the second case, because of the absence of singularities in the upper 
halfplane, it is equal to zero. 


Therefore, for e—0, 
Aexp| ~ ioo(e-*)| when t-Ž>0, 
c c 


vo(t-*)= 
c x 
0, when t—- <0. 


c 
(30.3) 

Let us now assume that T (œ) is an element of the scattering matrix 
that transforms a plane incident wave exp[— œ(t— x/c)] into a scattered 
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wave. We restrict ourselves here to a one-dimensional case. Then the 
scattered wave has the form 
+o 
A „ exp[-— iz (t — x/c)] 
y(t, x) == | T(z) f - 
2ri z' — (wo + ie) 


— oe 
where e—0. From the causality principle it follows that the scattered 
wave W(t, x) cannot arise earlier than the time when the incident wave 
Wo(t—x/c) arrives at the scattering particle. In other words, 


w(t,0)=0 for t<0. (30.5) 


, (30.4) 





This requirement will obviously not be fulfilled for all T (œ). 

We shall consider T (œ) as a function of the complex variablez=w+il’. 
Let T(z) vanish on the circle R=|z|—00 and have singularities only on 
the real axis. Restricting the behavior of the amplitude T(z) in the 
complex plane z=w-+il allows us to write (30.4) for the point x=0 in 
the form 


—izt 


w(t, 0) = Fis | T (z) (30.4’) 
2n 


i z — (wo — i£) 
(here e—>0). 

Because of our assumptions, the function T(z) is regular on the 
infinitely large semicircle R->0o and it does not have any singularities 
inside of it. Therefore, we may apply to (30.4’) the same method of in- 
tegration that we used to compute the integral (30.2). Bypassing any 
singularities of T(z) on the real axis from above (z2=@+ ie, e>0) and 
closing the integration contour for t>0 by a semicircle in the lower half- 
plane, we obtain a value for w(t, 0) which is in general different from zero. 
For t<0, the contour is closed from above and because of the absence of 
singularities, y(t, 0)=0. 

Thus if the amplitude T(w) has the analytical properties discussed 
above, then the scattered wave (30.4’) will satisfy the causality condition 
(30.5). Applying Cauchy’s theorem to the amplitude T (œ) gives 


Ta) sb O%. (30.6) 





Z—QW 
c 


We take for the integration contour the line z=w-—ie, i.e., a small 
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semicircle with radius p, which goes around the point z=, and a large 
semicircle with R=|z|0o which closes from below. Remembering that 
T=0 on the large semicircle, we integrate along the infinitely small 
semicircle around the point z= and, by separating the real and imag- 


inary parts, we obtain 
+æ 


Re T (%) =: f ee (30.7) 





where the sign $ denotes the principal value of the integral around the 
point w’=a@+p, p—>0. Equation (30.7) is the simple dispersion relation 
for the scattering amplitude T (a). 

We now assume that the amplitude T(z) has singularities other than 
those which lie on the real axis. We first consider the case when the 
amplitude T(z) has an essential singularity at |z| 0. We set 


T(= T(J e, (30.8) 
where t=a/c, a being some length, and the function T(z) has the pro- 
perties discussed earlier. Then 


œ 
T (2) eT iztts 


A 
w(t, 0) = — | dz. (30.9) 
2ni 


Z — (wọ — i£) 





— PEF 
Using the method described above to evaluate this integral, we find that 
y(t, 0) is not equal to zero for t<—t, i.e., the scattered wave arises 
before the incident wave reaches the microparticle. Such behavior 
violates causality and it may be interpreted as the presence in the micro- 
particle of some linear dimension a (‘an elementary length”) along which 
a signal inside the particle propagates infinitely fast. Thus a violation 
of the correct behavior of the amplitude T(z) for |z| 00 leads to a 
violation of causality. 

Let us now assume that the amplitude T (z) has a pole on the imaginary 
axis z= +iA. It is easy to see that in this case causality will be violated 
so that now, because of the presence of the pole z =iA (A>0) in the up- 
per half-plane, the amplitude of the scattered wave w(t, 0) for ¢<0 will 
be equal to 
Res T (iA) ' 


+At (30.10) 
iA — Wo 


’ 


W(t,0)= A 
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This violates causality during the time |t|~1/A. For t>0, an additional 
term with the factor e7“ appears which is attenuated for ¢>1/A. 

Equation (30.7) shows that the dispersion relation relates the real 
part of the scattering amplitude to the imaginary part. In some cases the 
integral in (30.7) may be expressed in terms of observable quantities. 
Therefore, an experimental examination of the dispersion relation may 
serve as an investigation of the basic assumptions of the modern local 
theory of fields and, in particular, the microcausality principle which is 
important to the entire theory. 

However, the properties of the elements of the scattering matrix 
Sp; which result from the general principles of the local theory of fields 
are difficult to determine, and as yet this problem has not been solved. 
Significant progress has thus far been made only in the simple case of 
elastic scattering, by Goldberger [59], Symanzik [60], Bogolyubov [61], 
et al. In particular for the case of forward elastic scattering (for the angle 
6° =0) of m-mesons on nucleons, the dispersion relation does not con- 
tain any unobservable quantities. 

The most mathematically rigorous proof of these relations was given 
by Bogolyubov and we refer the reader to reference [61] for details. Here 
we shall give only an outline of the dispersion relation theory insofar as it 
is necessary for our subject. 

A basic requirement for constructing dispersion relations for nN 
scattering is the relativistic generalization of the theorems concerning 
the analyticity of Fourier transformations of functions which vanish 
for t<0. 

Let F(x) be a function of x(x, t) which vanishes outside the absolute 
future light cone, i.e., for x <0. Its Fourier transform F(k) is given by 


F(k) = | F (x) exp (ikx) d#x. (30.11) 


Remembering that kx = wt — kx, for the complex value of k, for k>k + ig, 
@—>w-+il, the exponent in (30.11) takes on the form exp[—i(kx — wt) + 
+(qx—TIt). In the region where F has a significant value, where 


r-q@>0, T>0, (30.12) 


we may select a coordinate system so that g=0. Then the factor e'* in 
(30.11) acquires a cut-off factor e~™. Because of this factor the integral 
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(30.11) and its derivative with respect to k to any order will converge. 
This shows the Fourier transform F(k) is an analytical function of the 
variable k in the region (30.12). We may also show that for |k|— 00, F(k) 
increases to a value that is less than some order of |k|" (see [61]). 

The dispersion relations are determined for the matrix elements 7;; 
of the matrix Î (see Section 24). For elastic scattering of z-mesons on 
nucleons, wave functions of the initial state (i) and final state (f) may 
be written in the form described in Section 24 [see (24.14) and (24.18)]: 

ei: . 
y, = u,e'P™, (30.13) 


i 
w 





a e7 ik’y2 E 
P* = ity @7 Px, (30.14) 


w i 


where p=(p, po =(M? +p?)"?) (M is the nucleon mass), k=(k, kj =a 
=(m? +k?)*/?) (m is the meson mass). In the final state the corresponding 
quantities are denoted by primes. We did not include the spinor indices 
of the nucleon nor the isotopic spin indices of the meson so that the equa- 
tions would not be too cumbersome. 

The amplitude T,; is the matrix element of the four branch s(x2,y2 | 
Xi» Yı), taken for the state (30.13) and (30.14). By introducing the coor- 
dinates Y=4(y,+y2), Y=)2—), and X¥=3(x, +x), X=xX2—%, it is 
easy to show that the matrix element of interest, 7,;, may be written 
in the form 





Ty; = (p’, k'| T |p, k) 
2 k+k’ 
== tlk + p’ —k — p) T, = P, p). (30.15) 
4ww' 2 


In the laboratory coordinate system for forward scattering [where 
p'=p=(0, 0, 0, M) and k'=k=(k, œ)] we have 


2 
Ta = : 





5(k’ — k) T.(k). (30.16) 


4ow 


The subscript c on the function T,(k) in (30.15) and (30.16) is used to 
stress that the function T,(k) is the Fourier transform of some function 
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T .(k) which is analogous in its space-time behavior to the causal function 
D,(x)*: 


T.(k) = | T(x) el dtx. (30.17) 


It is, however, more convenient to investigate the analytical properties 
of quantities which have retarded or advanced characteristics. 

It is possible to show that a retarded function T,,,(k) may be obtained 
which is analogous to the retarded function D,,,(k) whose real part, 
i.e. p> =M? and k? =m’, is identical to the function T,,(k) on the energy 
surface p+k=p'+k’' and 


T, (k) = Trey (k). (30.18) 


Together with T,«(k), we also introduce the advanced function T a (K) 
which is analogous to D,,4,(k) and also their difference 


T(k) = Tiea (k) — Taav(k). (30.19) 

The functions Te (k) and T,4, (A) satisfy some of the symmetry conditions 
for the inverse vector k, namely, 

Tha (k) = Tre (— k), (30.20) 

Tre (k) = Thay (— K). (30.20’) 

We will also need to have the amplitude 7(k) written as a sum of its 

dispersion (Hermitian) part D and absorption (anti-Hermitian) part iA: 

T (k) = D(k) + iA(k). (30.21) 

In the case of forward scattering in which the nucleon spin and 

isotopic meson spin are unchanged (i.e., coherent scattering), the quan- 

tities D (w) and A (œw) are real because the matrices D and A are Hermitian. 

We note one further detail. Because of the conservation laws, the 

vector k in the exponent of (30.17) contains irrationalities which introduce 

difficulties. In fact in the scalar product kx=a@t—kx the absolute 


magnitude of the vector k depends irrationally on w:|k|=(@?—m/?)1/? 
=). Therefore, by writing the scalar product in the form 


wt — kx = wt — esx, (30.22) 


t Note that in the second order of perturbation theory T’,(x) =D,(x). 
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where e=k/|k|, we can remove the irrationality in œ in (30.17) if, instead 
of (30.17), we consider functions that are symmetric or antisymmetric and 
that are written in the following form 


T.(e, 4) = T (e, 4) + T(-e, 4), (30.23) 
T,(e, 4) = Al [T (e, 4) — T(- e, 4], (30.23") 


so that they depend only on 4°. From (30.23) and (30.23 it follows that 
T (e, 4) = $[T;(e, 2) + AT (e, 4)]. (30.24) 


When the functions are written in this symmetric form, the absorbing 
part of the amplitude A has a certain symmetry with respect to sign 
changes, namely, 


A,(— œ) =+ A (o), (30.25) 
A,(— œ) =— A,(@). (30.25’) 


We shall use these equations later. 

We shall conclude our studies of analytical properties by considering 
the functions T,,,(w) and T,4,(@) written in the symmetric form (30.23)- 
(30.23’). A very complicated and laborious study leads to the result that 
the functions T,.,(@) and T ay (œ) represent one analytic function T (w) 
in the complex plane œ, which is identical to T,,,(@) in the region Im @>0 
and is identical to T,,4,(@) in the region Im œ <0. T(@) has branches on the 
real axis in the region — œ <w< —m and œ >> —m and has poles at 


m2 


o= ryt ee: (30.26) 
It vanishes on the circle |œw| = R> œ. 

Figure 33 shows the complex plane œw. The heavy line denotes the 
contour C within which T (œ) is an analytic function of œ. If the amplitude 
T(w) increases as w?, then it can be replaced by the quantity T’(w) 
= T(w)/(@—@ )"* which differs from T (œ) in that it has an additional 
pole at the point w=% and it tends to zero in the region |w|— œ. 

Cauchy’s theorem may be applied to T’ (œw) by choosing the integration 
contour as shown in Figure 33. Taking the limit Roo, we obtain the 
dispersion relation for T’(q@) in the form 
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Fig. 33. Integration contour for the amplitude T(q) in the complex plane w. This figure 

shows the branch cuts from — R to —m and from +m to + R (R> œ), and also the simple 

poles at w= + wg. The (n + 1)th order pole at the point wọ is produced by dividing the am- 
plitude by (w — wọ)" +*+. 


T (œ) 1 T T (œ) da’ 
( 


(@ — wo)"t!  2ri w' — Wo)"*! (w — w) 








—o tie 


oo — ie 


+ | T SY 30.27 
oe wa O 





We denote the discontinuity in T(w) on the branch cuts (— œ, — m) and 
(m, œ) by AT() and, multiplying (30.27) by (w—a@ )"*! we obtain 


Ss n+1 , 
T (œ) = a 2o f ( ane) da! 





w — t! (a — a) 








a (œ — wo)" t! | AT (a’) da’ 
2ni (w’ — wot! (w — œ) 


pA 2 
ef ae oh aa ` cor. (30.28) 
g 
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where g?/(w+,) are residues at the points w= + œ, and the constant co- 
efficients C, are calculated at the point w = wọ. 

The jump 47(q) at the branch cuts of T(@) is determined by con- 
sidering that the absorbing part of A, which differs from zero for œ < — m 
and w> +m, changes its sign in going from Im œw >0 to Im w <0 while the 
dispersion part of the amplitude D(w) changes continuously. Thus the 
jump 4 T (œ) is equal to 


AT (œ) = 2iA(a). (30.29) 


We may now make use of the symmetry in (30.25) and (30.25’) and replace 
the integration over negative œ with integration over positive œ. Then 
we obtain 


æ 








_2@- @o)"*! A (œ) da’ 
T 1 | (w — wt! (w? — w?) 
g g 5 ; 
+ TA + ©- + Cc," . (30.30) 
r=0 


The choice of index n remains arbitrary, but its smallest possible value 
must be determined indirectly. According to the optical theorem (XII.4) 


k 
A(o) = nm (œ), (30.31) 


where k is the meson momentum in the center of mass system and o7(@) 
is the total cross section for the process n ++ N—>N + ---. Experiments show 
that o;(w) tends to a constant value or even decreases with increase in 
w. We may, therefore, consider that for high energies k + w, the absorbing 
part of the amplitude A (w) does not increase faster than w— œ, that is for 
n<1. Since |D/? enters into o;7(@), the dispersion part of the amplitude 
D must decrease as w increases. These considerations allow us to limit 
ourselves to multiplying T(@) by (w—w@,)~? when constructing the 
dispersion relation for xN-scattering. 

In order now to obtain from (30.30) a specific dispersion relation for 
scattering of neutral n°- and charged x*-mesons, we must determine the 
dependence of the scattering amplitude on spin and isotopic indices. We 
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shall, however, give only the results of these studies here. We denote the 
dispersing and absorbing parts of the amplitude for neutral (0) and 
charged (+) mesons by Do, Ds, Ao, A+ respectively. Then the dispersion 
relations may be written as 

2 2 2 
g (@* — a) 
Do(@) — Do (0) = — w? 

ol ) o( o) M * (w? — w) (@3 — w2) 


to 5 22 att ee eae A 
0 





29? a(o? — o3) 
M (@? — @?2) (@ — @) 


Æ 2 (wo? — od | A, (w’) das 
T ( 


w? M w?) (w’? = w$) 





D, (œw) — D, (wo) = 








(30.32) 
w 29? ww? (w? — w) 
D: (w) — Dd Dz (w0) = M (@ — o2) (0 — o) 
2 2_ 22 A,(o') , 
tze 0 | BO te 
(30.32”) 
where 
D, (©) = D, (@) + D_(a), (30.33) 
D- (w) = D, (w) — D- (a), (30.33) 
A: (w) = A, (w) + A- (Œ), (30.33”) 
A: (0) = A, (w) — A_(@). (30.33””) 


The two arbitrary constants Cy and C; in the term )"4C,o’ are eliminated 
by subtracting the amplitude D (wo) and (@/w,)D(@ ) taken at the point 
w= wo from the dispersion relation. 

The absorbing parts A44(q@), Ao(w) may be expressed in terms of 
experimentally observed total cross-sections using the optical theorem 
(30.31). Finally the constant g may be defined from low-energy inter- 
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actions. In this manner the dispersion relations (30.32)-(30.32”) may be 
checked experimentally and, therefore, the validity of the local theory 
of fields can be proved or disproved. 

Another important result of the local theory which is also based on the 
analytical properties of the scattering amplitude, relates to the asymptotic 
behavior of the total cross section for particles and antiparticles. On the 
basis of the dispersion relations for forward elastic scattering of z-mesons 
off nucleons and using the optical theorem (see Section 23), Pomeranchuk 
[62] showed that the total cross sections for the interaction of x*- and 
m~-mesons and nucleons for very high energies (w— œ) have to be equal 
to 


OntN = On-N for w@->0. (30.34) 


Recently another theorem has been used to determine the asymptotic 
relationship between the cross sections for various processes. This is the 
theorem concerning the behavior of an analytic function f(x) inside an 
angle that is formed by rays, as a function of the limits C, and C, to which 
it tends along those rays [63]. The theorem states that if the function 
f(z) which is analytical inside the angle C,OC, had different limits, 
C,#C>, along the rays OC, and OC, then it would increase inside the 
angle faster than an exponential. 

Returning to our problem, we shall consider the plane z as a complex 
plane of the meson energy w:z=w. If we now take the angle x for the 
angle C,OC, (upper half-plane), and take the elastic scattering amplitude 
of a particle a being scattered off a particle b, extended in the upper half- 
plane, for the function f(z), then for z=@-—0o the function f(z) will 
correspond to the forward elastic scattering amplitude T (œ) for the pro- 
cess 

a+b>a+b; (30.35) 


For w> œ and z=w— — œ this function will correspond to the same 
amplitude T (w) for “cross-over” processes? : 


ã+b>ã+b, (30.35) 


where à denotes the antiparticle of particle a. 
Because the forward elastic scattering cross section is analytic in the 


t For details of “cross-over” reactions, see Section 31. 
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upper half-plane and its growth is limited to a polynomial (~ œ"), by 
virtue of the theorem stated above, the extreme values c, and c, of the 
amplitude of f(z), is its values at w= +00 must be equal. Because the 
imaginary parts of the amplitudes are equal, from the optical theorem 
it follows that the total cross section has to be equal. In particular, if 
particle a is a x-meson and particle b is a nucleon, we obtain (30.34). 

Using this same theorem, asymptotic relationships were obtained for 
many different reactions as well as the total xN scattering cross section 
and for differential as well as total cross-sections [64]. 

We shall quote some of these relationships. 


(a) For total cross sections with @—oo, identical cross sections are 
obtained for the following reactions 


nm +pr and x +p—>--, 
Kt +p. and K” +p, 

BAe. UN te (30.35) 
Foe pael and Y= pepsi, 


where the dots on the right indicate all possible products of the 
reaction on the left. 

(b) For differential cross sections with woo and with a specific 
momentum transfer, the following reactions have the same cross 


sections 
nm +pon*t+p and n +pon +p, 
K*+p—->Kt+p and K~ +p—>K~™ +p, 
nm +p>K*+y* and K` +p>r +}+, 
nm +p>K?+4 and K°+pont +A. 


(30.37) 


In concluding this section, we list the basic principles of local field 
theory which were used in obtaining the dispersion relations for the 
scattering amplitude T and the asymptotic relationship for the cross sec- 
tion ø. 

I. Microcausality must be fulfilled, which indicates the absence of any 

influence of one region of space-time R, (x) on another R4 (x) if the 
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II. 


HI. 
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points which belong to these regions are separated by a spatial 
interval 
X2 ~ Xj, x E R(x), x E R(X) 


and if the interval is time-like, i.e., x? >0 but x, < x; (the cause must 
precede the effect !). 
A spectrum of stable particles with positive energy 


Po = VP +m, (30.38) 


must exist, where m is the mass of the particles (s=1, 2,...). 
The amplitude T(w) for woo must not increase faster than a 
polynomial: 


IT (@)| < ala)", (30.39) 


where n is positive integer. 


CHAPTER VI 


MACROSCOPIC CAUSALITY 


31. FORMAL §-MATRIX THEORY 


We begin this chapter by listing some of the more important properties 
of the S-matrix. As we showed in Section 28, the S-matrix is unitary. It 


transforms an initial free particle state ; given at a time t= — œ into a 
final free particle state ¥,, which is considered at the time t= + œ: 
Sea ae (31.1) 


This matrix is usually written in the form [see (29.4)] 

§$=1+iT. (31.2) 
The unitary condition 

SS*+ =T (31.3) 

may be turned into an identity if we set 

=e", (31.4) 
where 7 is the Hamiltonian operator 

ñ=", (31.5) 


or it may also be written as 





ger Lg (31.6) 
I +iK 
where K is another Hermitian operator 
kokr (31.7) 


The concept of the asymptotic states ¥; and ¥, requires a detailed 
analysis. In particular, we shall now consider the limit t— +00 as 
purely formal and we shall not go into its meaning. We shall return to 
such questions later. 
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The asymptotic state itself is defined by a complete set of dynamic 
variables p which characterize a state of free particles. We choose the 
momentum of these particles for the set of variables so that 

Y, = ¥(p), (31.8) 
¥,= Y; (p), (31.8) 
where (p) is the set of momenta in the initial state, and (p’) is the set of 
momenta in the final state. (p)=(p,, Pp—15---» P1) if the initial state has 
n particles and (p')=(Pm+ns Pm+n-1>--- Pn+1) if the final state has m 


particles. For free particles, the four momentum components have 
specific values, namely, 


Pos = + V P? + m, s=1,2,..,m+n, (31.9) 


where m, is the mass of the sth particle. The set of values of the 
variables p=(po, p) form a three-dimensional manifold ®3(p) which 
corresponds to the surface of a hyperboloid of one sheet 

P-P =m, po > 0. (31.10) 
This manifold forms a Lobachevski space,t a scalar curvature of R 
[compare with (5.13)] is defined in terms of the particle mass 

R =— 1/m’. (31.11) 
Note that the second sheet of the hyperboloid 


Po =- Vym + p? (31.10) 


may be used to represent the antiparticle state. 
Thus a space in which the asymptotic states i and f are defined may be 
considered as composition of several Lobachevski spaces: 


R; (p) = R; (P1) O R3 (P2) ©---@ R3 (Pr). (31.11) 
The space ®,(p’) may be similarly constructed, so that 
R; (p) = R3 (Pp+1) @ Ra (Pp+2) DQ Ra (Prim) (31.11) 


We shall call the space R;(p) the space of initial states and the space 
R,(p’) the space of final states. We shall call both of these states the space 
of asymptotic states. 


* This fact was successively used in references [65] and [66]. 
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Note that the state R(p), by its very meaning, belongs to the set of 
characteristic values of the momentum operator p of free particles and 
the energy operator fo. For each characteristic value of the operator f 
and the sign of the operator fy, there exists a characteristic function 
w; (x). Therefore the set of all the characteristic functions y* (x) of the 
operators p and fy may be associated with each point in space. Each of 
these sets forms a Hilbert space R {y}. In this space, the functions wF (x) 
and their products are vectors or tensors which define a complete system 
of orthonormal functions. 

In the representation of the asymptotic states y, and y; that we 
selected, the rows of the -matrix are characterized by the momenta (p’) 
in the final state f, and the columns by the momenta (p) in the initial 
state i. Therefore, we may write 


Sy: = <P'|S |p 
= (Pm+m Pm+n-1> 229s P| S IPn Pr- 1> +++ Po P1). 
(31.12) 
£ It will be more convenient to separate the unitary matrix Î from the 
S-matrix in (31.3). Then we may rewrite (31.12) in the form 
Sa = CPT p> + i<p'l Î ipò, (31.13) 
where 


DE IP) = lPmtn Base ase Poeal Ê Wid ene P2 P1) 
(31.14) 


The -matrix must be relativistically invariant (compare to Section 
29). Let us transform the momenta (p) using the Lorentz transforma- 
tion matrixt 


p=Lp, (31.15) 
where (p) denotes the set of new, transformed momenta. The vectors of 
the state ¥ (p) undergo unitary transformation using the unitary matrix 
O,. 

P (p) = ÜY (Lo *p) = ULY (p), (31.16) 
where ¥ temporarily denotes the state vector in the new coordinate 


+ The matrix L differ from the matrix L in (29.25). 
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system. Applying this transformation to Equation (31.1) gives 


0, -(L~'p) = 0$ (L7 'p) Ur OLY: (+P), (31.17) 
i.e., 

P, = $) P, (31.18) 
where 

$) = 0.8L" 'p) r+. (31.19) 


Relativistic invariance requires that the same transformation law hold for 
all reference frames and, therefore, the following relationship must hold: 


S(L~'p) = $ (p). (31.20) 


This is the condition of relativistic invariance for the theory, expressed 
in terms of momentum variables. 

In the absence of external fields, because of the uniformity of space- 
time, physical phenomena cannot depend on the position of the origin 
in space or time. Therefore the -matrix must be translationally invariant. 
From this requirement it follows that 


DS — SD =0, (31.21) 
where D is a translation operator, which has the form 
D = exp (iÊ,a,), (31.22) 


Ê, is the operator of the total momentum of the system being consider- 
ed, and a, is the vector of an arbitrary displacement. From (31.21) and 
(31.22) it follows that 


PS — SP, =0, (31.23) 


i.e., the -matrix commutes with the total momentum operator P. There- 
fore the S-matrix must be diagonal with respect to the total momentum 
operator P. Therefore the matrix elements S ri must be proportional to 
6-functions of the complete four-dimensional momentum, i.e., pro- 
portional to the function 


ôt (P — P), 
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where 


=) pP = } pars and P=) p=) p. 
s=1 s=1 


Note that the matrix element of / is equal to 
<P T |p) = & (p' — p) = Ul 5° (Ps — Ps), 


i.e., to the product of three-dimensional 6-functions. For n>2, it is 
always possible to obtain a four-dimensional 6-function of the total 
momentum from this product, so that 


(Pit Ip =S (W —p=H4(L P-L pl lps 81.24 


where I(p’ | p) is expressed in terms of relative momenta. The matrix 
element <p’ |Î|p> must also be proportional to 64(Yp' — Yp), i.e., 


<P'| T |p) =t Ep -ET |p). (31.25) 


Further details of the form of the matrix elements may be obtained by 
considering the fact that in a relativistically invariant theory the multi- 
plication law for any two matrices A and B, taken in the momentum 
representation, must be relativistically invariant. We have 


<p'| AB |p =f <p'| A |p") d°p” <p"| B |p). (31.26) 


However the volume element d?p” is not invariant. We may ensure the 
relativistic invariant character of the matrix multiplication if we rewrite 
the matrix elements in the form 

nf A(p’, p) 
PAN Sse (31.27) 

V2p0---»/2Po 

where po is the fourth component of the momentum and A(p’, p) is an 
invariant function of the momenta (p’) and (p). In fact, in this case in 
place of (31.26) we obtain 
= ees (p’, p”) d°p" Bip", p) 
<p'| AB |p> = s = p} . 
V2 ny2 / 200 © V 2Po 


(31.26’) 
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The volume element d3p”/2p¢ that appears here is a volume element in 
Lobachevski space, which is relativistically invariant. 
Finally, we write the elements of the S-matrix in the form 


Sji = <p'| K Ip» 
T(p’ | p) 


= <p'| 1 |p> + i5* È p — Y} p, (31.28) 


2p)... ./2Po 


where T(p’ |p) is an invariant function of the momenta of the initial and 
final states. 

We shall now consider in detail the structure of the S-matrix in the 
simplest case when the particles are of one type, scalar, uncharged and 
have a rest mass m. We assume that the interaction between them is such 
that it neither leads to the formation of stable groups with a mass M that 
is smaller than the sum of the masses of the original particles, nor to 
unstable groups which have a mass M* that is larger than the sum of the 
masses of the original particles. The possible processes in this simple 
case are limited to scattering, generation and absorption of particles. In 
this case there is only one type of scalar particles, and these are able to be 
scattered, produced and absorbed. For brevity, in Table I the matrix 
rows ( f) are listed according to the number of particles m that are found 
in the final state and the matrix columns are arranged according to the 
number of particles found in the initial state n. A matrix element <m lln), 
which is defined in the asymptotic space R(p, p) =R: (PPOR; (p 
which has 3 (m +n) dimensions is written in each location (m, n) of Table I. 


TABLE I 
The scattering matrix for the simple case of scalar particles 








1 2 3 4 5 6 7 8 
f 

I LI 

2 2,2 2,3 24 2,5 2,6 2,7 2.8 
3 3,2 3,3 3,4 3,5 3,6 3,7 3,8 
4 4,2 4,3 4,4 4,5 4,6 4,7 4,8 
5 5,2 5,3 5,4 5,5 5,6 5,7 5,8 
6 6,2 6,3 6,4 6,5 6,6 6,7 68 
7 72 7,3 7,4 7,5 7,6 7,7 7,8 
8 8,2 8,3 8,4 8,5 8,6 8,7 8,8 





MACROSCOPIC CAUSALITY 179 


The element (1, 1) is equal to one, the diagonal elements (2, 2), (3, 3), 
..., (m, m) describe elastic scattering. (We define elastic scattering as the 
scattering in which particles are neither produced nor absorbed nor do 
they change their inner state.) The elements for which mn describe 
transition which are accompanied by production or absorption of part- 
icles. The empty locations indicate matrix elements which are equal to 
zero. Note that all the elements on a diagonal for which m+n=constant 
are described by the function T (p’, p) (31.28) with a certain fixed number 
of variables 3(m-+n). This fact allows us, at least in some theoretical 
schemes, to consider all the functions 


<m \7| n> = T(P1, Pr aiia Pm) Pi» Pz» -> Pn) 
as one function 
T(p) = T (Pis P25 -+s Pm Pmt is ++ Pm+n)> (31.29) 


in which the properties of the initial and final states are the same. This 
type of symmetry in the initial and final states is called crossing sym- 
metry. In the general case for the reaction in which in the initial state (i) 
there are n particles, a, b, c, d,..., n and in the final state (f) there are m 
particles x, y, Z, u, ..., M 


a+b+c+d+ >x+y+z+u+-> (31.30) 
the function T (p) (31.29) will be the same as that for the reaction 
Atb+ ct + 84+ 24+ 9dt ete tute, (31.30) 


where an arbitrary number of particles is transfered from the left to the 
right of the equation, of vice versa, but the particles are replaced by 
antiparticles (denoted by ~). 

A special case of this situation is shown in Figure 34. The properties 
of cross symmetry allow us to consider the set of spaces R,;(p) and 
R,(p’) as one space 


RK (p) = Ri (P) @ R; p’). (31.31) 


We now turn to the case in which, besides the “original” particles 
with mass m, there also exists a stable particle with mass m<M<2m. 
Such a particle may be considered the result of two particles with mass m 
coalescing; this being accompanied by a mass defect of A=2m—M>0. 
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1 (b) 


Fig. 34. An illustration of cross symmetry. In (a), 1, 2, 3 are particles in the initial state and 


1’, 2’, 3’, 4’ are particles in the final state. In (b), particle 3 1s transferred to the final state, but 
as an antiparticle. The junction s is the same in both diagrams. 


The existence of such particles must be taken into account in the asymp- 
totic states by considering that the corresponding Lobochevski space will 
have the curvature [see (31.11)] R= —1/M?. 

The scattering matrix for this case is shown in Table II. The ordering 
of the rows and columns is defined as before in terms of increasing rest 
mass of the particles that are found in the initial and final states. In 
order to simplify the notation the mass of the light particle m is taken 
to be 1. The existence of mutual transmutation of particles complicates 
the matrix. Apart from the unitary matrix element (1, 1), there is only 
one other element (M, M) which describes the motion of the particle M 
in terms of inertia. The elements (M, 2) and (2, M) are equal to zero 
because, according to the assumption that m< M <2m, the particle M 
cannot decompose into two particles m. Similarly, the process in which 
two particles of mass m coalesce into one M particle is impossible. 
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TABLE H 
The scattering matrix for the case of two types of scalar particles, particles with mass m 
and with mass M, where m< M <2m. The rows and columns are ordered ın increasing 








rest mass. 
i 
1 M 2 M+1 3 2M 4 2M+1 
f 
1 1,1 
M M, M 
2 22 2,M+1 2,3 2,2M 2,4 2,2M+1 
M+1 M+1,M+1 M+1,3 M+1,2M M+1,4 M+i, 
2M+1 
3 3,2 3,M+1 3,3 3,2M 3,4 3,2M+1 
2M 2M, 2 2M,M+} 2M, 3 2M,2M 2M,4 2M, 
2M+1 
4 4,2 4,M+1 4,3 42M 4,4 42 M+1 
2M +1 2M+1,2 2M+1, 2M +1,3 2M+1, 2M+1,4 2M+1, 
M+! 2M 2M+1 





However, such processes are possible if three particles take part, because 
the third particle can carry off the extra energy which is released in this 
process. Therefore, the matrix elements (M +1, 3) and (3, M + 1) are, in 
general, not equal to zero. Amongst the matrix elements are those which 
describe processes that involve only M particles [e.g., the element (2M, 
2M) describes elastic scattering of M particles, the element (3M, 2M) 
describes the formation of M particles during a collision of two M part- 
icles, etc.]. 

We now turn to the case where the M particle has a mass that is 
greater than 2m. There are now two possibilities: (1) the particle M is 
stable and so the processes M2m are forbidden; (ii) the particle M is 
unstable and in this case the processes M2m are allowed. The scattering 
matrix for the case of stable M and m particles which can be produced, 
absorbed or scattered in interactions, is shown in Table III. In contrast 
to the matrix given in Table II, in this case the rows and colums which 
contain M particles follow after rows and columns which contain two m 
particles. For definiteness, we assume that 2m < M <3m and, therefore, 
the third row and third column follow the row M and column M, 
respectively. 

The matrix elements (M, 2) and (2, M) are equal to zero because of 
the assumption of stability of the M particles. This is a special exclusion. 
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TABLE III 
The structure of the scattering matrix for the case of two scalar particles with mass m and 
M, where 2m < M<3m and the particle M is assumed to be stable. 








1 2 M 3 M+i 4 M+2 2M 
f 
l 1,1 
2 2,2 2,3 2,M+1 2,4 2,M+2 2,2M 
M MM 
3 3,2 3,3 3,M+1 3,4 3,M+2 3,2M 
M+1 M+1,2 M+13 M+1,M+1 M+1,4 M+1,M+2 M+1,2M 
4 4,2 4,3 4,M+1 44 4, M+2 4,2M 
M+2 M +2,2 M+2,3 M+2,M+1 M+2,4 M+2,M+2 M+2,2M 
2M 2M, 2 2M, 3 2M,M+1 2M, 4 2M,M+2 2M, 2M 





The processes M = m= 2mare not considered forbidden; in particular, the 
process 2m— M +m may be considered as an inelastic impact which is 
accompanied by the absorption of a light particle m and the formation of 
a heavy M particle. The processes 3m M are forbidden because of the 
condition M <3m and, therefore, the corresponding matrix elements (3, 
M) and (M, 3) are equal to zero. 

At present the existence of a very large number of unstable particles 
is known. Therefore, the case in which the M particle is unstable is very 
important. However, it is impossible to consider this case without a 
space-time description of the phenomenon. Thus we shall postpone 
considering case (ii) and turn to some details related to space-time 
descriptions using the scattering matrix. 


32. SPACE-TIME DESCRIPTIONS USING THE -MATRIX 


By definition the $-matrix transforms an initial state ¥; which is specified 
at ti = — œ, into a final state ¥, which is determined at t,= + œ. In 
Section 28 we showed that [see (28.18)] the S-matrix is a limit of the 
unitary operator U{o,, o} which transforms the state ¥;{0,} which is 
specified on the spatial surface c, into the state ¥, {a} on the surface 
6. In the case when the spatial surfaces are planes, these surfaces 
degenerate into the surfaces c, =t, and c, =¢,, so that 

$= lim Û(t,, tı). (32.1) 


tı >- o 
t22- %0 
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We must now determine more accurately the meaning of the limits 
ti >œ and ft, 00. We shall consider the simple case when there are only 
two particles A and B in the initial state. We shall consider that the initial 
state is specified for t=¢, and is described by the wave packets u, and up, 
respectively. Then 


P, (tı) = ua (t1) Up (ty). (32.2) 


We may assume that these packets were formed by passing through 
apertures s% and Z which were opened for some time close to the moment 
in time t, = — T. We assume that the initial dimensions of the packets are 
a, =a, =a (a, is the dimension perpendicular to the direction of motion 
and ay, is the parallel dimension). Let the distance between the packets at 
t,;=—T be R>a. We shall consider that the particles have sufficiently 
well-defined average momenta P,>Apwxh/a, Pg>Ap~xh/a and the 
corresponding average packet velocities are v,=0EF,/0P, and vg=0E;/ 
OP, where E,=(m?c*+c7p3)!/?, E,=(m?c*+c7p3)'? are the average 
energies. In the center-of-mass system the relative velocity of the packets 
will be v=v,—v,g, and E4 = Ep. In the following we shall omit the sub- 
scripts A and B because of importance is only the relative motion of 
packets. If the interaction radius of the particles is b,* then for b>@-the - 
packets u4, ug will interact for a time T~ R/v. In this case the particles 
may be considered as asymptotically free during the time 


R b 
T=—>-, (32.3) 
v v 


beginning at the time ¢=1,. In the case when b <a, we must remember that 
the accuracy to within which the particle may be localized and, therefore, 
for b<a, in place of (32.3) we have the condition 


R 


a 
Tams (32.3') 
v v 


Because a> h/p =z (4% is the wavelength characteristic of the packets), in 
general R<zZ in the asymptotic region. 
After the interaction for t, = + T, the wave function of the final state 


t Itis well known that the radius of interaction b is not always definable. In particular, in 
the case of Coulomb interaction, it is infinite We here exclude this case from our consider- 
ation. 
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Y ,(t2) may be written in the form 
Y (t2) = ua (t2) ug (t2) + u (t2), (32.4) 


where the function u4 (t2) ug (t2) describes the initial packet in the moment 

t,, and the function u(t) is the asymptotically scattered wave. The 

scattered wave u for large separations r of the particles has the formt 
e e 

ECS AED R (32.5) 


where k=1/% and C, D,... are coefficients that do not depend upon r. 
The asymptotic expression for this wave for t, = T= R/v is 
ikr 


u(R) = C— (32.6) 


and we assume the separation r = R is sufficiently large in comparison to % 
so that it is possible to neglect the second term (De‘*®/R?) and all higher 
terms in (32.5). Therefore, the asymptotic region corresponds to the 
region which is usually called the wave zone (R>A). 

In “the language of time’’, neglecting terms of the order 1/R? implies that, 
in taking the limit t, = T—> + œ, terms of the order 1/7 are kept and terms 
of the order of 1/T? are considered negligibly small. 

Note that there also exists an upper limit for T if we want to conserve 
the space-time description for the collision process. In fact using wave 
packets in this description assumes that the initial packet (specified for t4 
= —T) does not diffuse significantly before the collision, for otherwise 
the moment of collision would be quite undefined. 

According to the relativistic wave-packet theory the packet, which has 
has a width A at the initial moment in time, diffuses in time so that its 
longitudinal width A, (^) is equal to [see (18.10) and (18.10’)] 


$? mt 
A - j” tap" (32.7) 


and the transverse width 4, (t) 


72 
A, = _fa+ a vr. (32.7') 


t In the general case the scattered wave also describes new particle production in 
collisions. For simplicity, we shall limit ourselves here to elastic collisions. 
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Because of the properties of the factor (m/E)* in (32.7), Eq. (32.7’) is more 
critical than (32.7). The limits on the diffusion of the packet during the 
collision process implies that 4, at t=27 does not greatly differ from 
A, at t=0. Therefore, a*>2%?0?T? =%?R?. Combining this inequality 
with (32.3) and (32.3’) gives 

2 


> >R>(b or a)>F. (32.8) 


Figure 35a shows the paths of the particles which pass through the 
apertures ./ and Z. This figure shows the case when the radius of inter- 
action of the particles is greater than the dimensions of the packet a. 
Figure 35b shows the case when b <a so that the region of possible inter- 
action is defined by the packet dimension a. In both cases the packet 
diffusion in a time T is assumed to be small. Figure 35c shows the case 
when the packets diffuse so rapidly that all information on the moment 
of collision is lost. 





(a) (b) 


Fig. 35. Wave packets u, and u, which are emitted from the aperture A and B ın an ab- 

sorbing screen at a time ft, = — T are shown here. Around t=0 a collision occurs; u; and u, 

are the same packets (partially scattered) at t, = + T. b is the radius of interaction, a is the 

aperture width, R= AB=vT is the asymptotic separation of the packets. Figure 35a shows 

the case of b> a, R>b. Figure 35b shows the case b <a, R >a and Figure 35c shows the case 
of very large diffusion of the packets so that no information on causality exists 


These restrictions on the limits |t,|, |t;|-00 apply to all particles. 
Unstable particles have other restrictions which are significant. Let the 
initial beam (¥,) include a particle with a decay period of t;. For |t,|> 
>T; the unstable particle decays and the incident bean changes its char- 
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acter. For |t,|<t, the unstable particle may be considered as stable. 


Therefore, for the limit t; = —T the following restriction holds 
b a 
™>T>- |or>-]. (32.9) 
v v 


A similar restriction holds in the case where unstable particles are present 
in the final state. The instability of the particles may be ignored if 


b 
1, >T>- (or > 5). (329') 
v 


We now assume that in the spectrum of particles there is an unstable 
particle whose mass in M, where 2m < M < 3m so that the particle M may 
decay into two m particles: M—2m. Let the decay period be t. Then there 
are two possible ways of constructing the S-matrix. If the asymptotic state 
is defined for T <t, then the particle M may be considered stable and the 
S-matrix has the form given in Table II. In this case the space R(p) in- 
cludes two types of Lobachevskii space. The first type has a curvature that 
is defined by the mass of the particle m, and the second type has the curva- 
ture defined by the mass of the heavy particle M. 

We may also consider the case when the asymptotic time T>t. In 
general unstable M particles will not appear in the asymptotic case 
because they disintegrate into m particles in the time T>t. In this case 
the asymptotic space consists entirely of Lobachevski spaces that have 
curvatures characteristic of particles with mass m. The scattering matrix 
then has the form given in Table I. 

However the “memory” of the possibility of forming unstable M- 
particles remains and it is expressed in the fact that the matrix element 
(2, 2) which describes elastic scattering of two m-particles m + m—m +m 
has a resonance when the relative energies of the two m-particles is equal 
to the mass of the M particle. By convention, this resonance could be re- 
written as a scattering process, which takes place by an instantaneous 
formation of an M-particle, in the form 


m+m>M>m+m. (32.10) 


The width AE ~h/t of this resonance is defined in terms of the lifetime 
of the M-particle.* 
From these consideration it follows that the character of asymptotic 


+ For further details on describing unstable particles using an S-matrix, see Section 34. 
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space R(p) and the character of the asymptotic states ¥; and ¥ s depend 
essentially on the meaning which is given to the limits tf; — œ and 
t,— +00 in (32.1). 


33. THE SCALE FOR THE ASYMPTOTIC TIME T 


The interactions of particles are usually divided into strong, electro- 
magnetic and weak interactions. These interactions define the time scale 
t which serves as a criterion for selection of the asymptotic time 7. In 
In the case of strong interactions, such as the interactions of baryons with 
themselves, the radius of the interaction region is equal to an order of 
magnitude, to the Compton length of -mesons, i.e., b=h/m,c=1.4 x 
x 107+? cm. 

The time t, during which a group of strongly interaction particles 
disintegrates, is t,=/v where v is the velocity of the particles. For vxc 
this gives t,~5x 10774 sec. In these interactions, excited states of 
baryons, baryon resonances and mesons, are produced. These and other 
particles are unstable. Baryon resonances are described by the equation 


m+N>N*>N+m, (33.1) 


where m is a meson, N a baryon, and N* is an excited baryon. The cor- 
responding Feynman diagram is shown in Figure 36. Amongst similar 
reactions is, for example, the reaction 


n+N>N}2>7+N, (33.2) 


m x 





N N 


Fig. 36. Diagram of baryon resonance. The wavy line denotes a pion (z), the straight line 
a baryon (N). The heavy line denotes the excited nucleon (N*). Both vertices 4 and B 
are “strong”. 
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where N is a nucleon and Nf, is the resonant state which has the quantum 
numbers 7; =+ (T, is the third component of the isotopic spin) J=4, 
is the spin, P is the parity +, and strangeness s=0. The energy of this 
resonance is around 1400 MeV. 

At the present time a large number of baryon resonances are known 
[67]. The width of these resonances F is equal to an order of magnitude to 
~ 100 MeV so that the lifetime of these particles is equal to 


h 
Tt = =s x 10774 sec, 


i.e., it is of the same order of magnitude as the time for strong interactions 
of the original particles t,. The lifetime of mesons which decay in strong 
interactions, e.g., 

pontn, p’>ont+n, K*>K+r. (33.3) 


is of the same order of magnitude. The corresponding diagrams are 
shown in Figure 37. 


x 
P 
x 
n 
p? 
x 
K 
k* 
x 


Fig. 37. “Strong” vertices for the disintegration of p-, p°- and K-mesons. 
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Therefore, when we select T> t,, we must consider baryon resonances 
and mesons, which decay during strong interactions as unstable particles. 
They do not enter into the asymptotic space R(p). 

Note that the lifetime of baryon resonances and mesons that is 
defined above refers to a system at rest. If the disintegrating particle is 
moving, then its lifetime t’ increases by a factor of E/m, where E is the 
energy of the unstable particle and m is its rest mass: 

, 2 
T= aoe (33.4) 
It is for this reason that ultrarelativistic particles may be observed at 
very great distances R from the place where they are produced, in a way 
indicated by 


E 
Reet =ct—. (33.5) 
m 


Therefore, an unstable particle with Em may change to the class of 
conditionally stable particles (if Et,/m>T>t,) and its existence must 
then be reflected in the space R(p). 

Amongst the unstable particles that are formed in strong interactions 
are x°-mesons. These mesons decay into two y-quanta so that electro- 
magnetic interactions take part in the decay mechanism 


nmoyty (33.6) 


The diagram for this decay is shown in Figure 38. In addition to the strong 
vertex A there are two electromagnetic vertices B and B’. These vertices 
are proportional to the electromagnetic interaction constant e/(hc)!/? 
=«a'/? and, therefore, the factor «? =(e?/hc)~10~* appears in the ex- 
pression for the probability of such a decay process. Because of this con- 
stant, the decay time of x°y+y’ is far greater than t,. Estimating the 
lifetime of a z°-meson from this diagram taking into account the phase 
space volume of the vertices B and B’ gives a value of t,,~ 10716 sec. 
For a value of 7 in the range 


Ty > T > t,, (33.6’) 
a x°-meson may be considered stable. For T> T,, in the asymptotic space 


R(p) a photon must be included because at some distance from the point 
of interaction a y-quantum will be observed instead of the x°-meson. 
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Fig. 38. Diagram for the decay process n°—y+y'. The vertex A denotes a strong inter- 

action. The 2°-meson dissociates here into a pair of nucleons (N) and an antinucleon 

(Ñ). At the point B the nucleon emits a photon y. At the point B’ the nucleon N is anihilated 
by the antinucleon Ñ and a photon y’ is emitted. 


A significant number of mesons and excited baryons whose decay is 
accompanied by a charge in strangeness (4s #0) have a still longer life- 
time because weak interactions take part in their decay mechanisms. The 
decay of x*-mesons is decribed in Figure 39. This diagram consists of 
one “strong” vertex A and vertices of “weak” interaction W. Because the 





Fig. 39. Diagram for the decay of a z*-meson into a y-meson and a neutrino v. A is the 
“strong” vertex. The vertex W denotes a weak interaction. 


constant of the weak interaction is small, the lifetime of the 7*+-mesons 
which decay according to 


nt out +y (33.7) 
(here u is a mu-meson, v a neutrino), has a value of t, + ~ 1078 s. 


Consequently, if we consider z*-mesons as stable particles, we must 
take a value of T in the range 1078 sec> T> 107 * sec. 
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The decay of baryons (hyperons) which is accompanied by a change in 
strangeness as, for example, the process 


Z >n+e +i (33.8) 


(~~ denotes a hyperon, n a neutron, e~ an electron, ¥ an antineutrino) is 
described by the diagram shown in Figure 40. Here there are two “strong” 





Fig. 40. Diagram describing the decay of a X~ hyperon into a neutron n, electron e~ 
and an antineutrino Ÿ. A and B are the vertices of strong interactions, and W is the vertex 
of a weak interaction. 


vertices A and B and one “‘weak” vertex W which describes the process 
A° spt +e +. (33.9) 


The interaction characteristic of thid process is considered to be an 
“elementary”, weak interaction (see [44]). The lifetimes of particles 
which decay in such weak processes are ts~ 107 '® sec. 

The foregoing shows that serious limitations must be taken into 
account when taking the limits tr; > +œ and t, > —oo in the S-matrix. 
These limitations arise from the analysis of the physical processes which 
are possible and the character of asymptotic space and the asymptotic 
states ¥; and Y, depend on the choice of time T. 


34. UNSTABLE PARTICLES (RESONANCES) 


We now return to the case in which there is an unstable particle with 
mass 3m>M> 2m in the mass spectrum. 

As we showed above, if the asymptotic time T is chosen such that 
T <t, where t=h/G is the lifetime of an unstable particle (G is the re- 
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sonance width) then the M particle may be considered stable and it may 
be included in the asymptotic space R(p). Therefore for 7’<t, the situa- 
tion reduces to that discussed earlier in Section 31 where we assumed that 
two types of stable particles exited. 

We shall now consider the opposite case, for T> 1. In this case only one 
type of particle, that with mass m<M will enter into the asymptotic 
space (p). Particles with mass M will appear in the form of a resonance 
state which, for T>t>t, has essentially completely disintegrated into 
the original stable m particles. 

We may assign a certain mass M (the resonance width G < M) to the 
resonance state and therefore also a total momentum P =(P, Po) where 
Po=+(p?+ M7’)!/?, We may also assign to this state an angular momen- 
tum operator K, which consists of the spin of the m particles Sis (if they 
have any) and of the orbital momentum of their relative motions Ên 
=p,X,—P\X, (u, v=1, 2, 3, 4). The internal moment of the resonance 
state K,, is defined as 





K, = 3 Enapa ? (34.1) 


where é,,.g, is the completely antisymmetric unitary tensor and My is the 
total angular momentum 


Mag = Lag + Sap - (34.2) 


In the center of mass system the pseudovector K, reduces to the 
three-dimensional angular momentum operator M(M,3, M31, M, >) [68]. 
In this coordinate system then, the operator K. » has a special meaning that 
is characteristic of non-relativistic theory, namely, 


K? =hJ(J +1), K= hm,, (34.3) 
where 
—~J<m,<+J, J=0,1,2,... or J=4,3.... (44) 


Thus the resonant state which for M>G represents an unstable 
particle may be assigned not only a mass M and total momentum P but 
also a characteristic angular momentum K. In their resonant state 
mesons are usually called resonons. The term resonance is reserved for 
states with baryons and mesons. 
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Resonons as well as resonances have other properties as well as 
those already mentioned (isotopic spin, internal symmetry, strang- 
eness). However, we shall not discuss these questions which relate to the 
classification and spectroscopy of elementary particles, but shall return 
to considering the simple case of possible resonances in a system of two 
scalar particles with mass m. We shall use the center of mass system to 
study resonance, so that P=p, +p, =0 (p1, p2 are the momenta of the 
particles). We shall concentrate on the relative motions of these particles 
which have a specific angular momentum J. 

The asymptotic wave function which describes the relative motion of 
the two m particles may be written as a sum of converging and diverging 
spherical waves, namely, 


Wr, t) = [a (s) èP + b(s) e~] e iF, (34.5) 


where p is the angular momentum, E the energy of the particle in the 
center-of-mass system, r the particle separation and ¢ their mutual time 
(see Appendix X). 

In the following it will be convenient to consider the momentum p 
and energy £ as functions of an invariant energy 


s=P2—P?, (34.6) 
In the center of mass we have 

s = P? = E? = (E, + E,)? = 4m? + 4p’, (34.7) 
from which 

pP =łs- m, (34.8) 
so that 

ip(s) =- |m — 5 (34.9) 

—iE=-,/-s (34.10) 
and 


1 eo E 
wir, t) = - [a(s) eT vm —s/4r + b(s) etvm—s/ar evs, 
r 


(34.11) 
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We shall now consider s as a complex variable and in order to remove 
the ambiguity in the radicals in (34.11) we make a branch cut on the real 
axis of the complex plane (s) from s= — œ to s=0 and from s=4m? to 
S+ œ (see Figure 41). 


A (s) 


aan 


4m? 








Fig. 41. The complex plane for the variable s. Two branch cuts are shown, from s = — co 
to s=0 and from s=4m? to s= +œ. 


Using this branch cut we extend the function of the complex variable 
s onto the second sheet of the Riemann surface. We define the first sheet 
of this surface (the “physical” one) such that 


s x 
Jr = = gil? eie- 0/2, 





(34.12) 
where 
R 2 5 1/2 arad 
e |m =g s sinz > 0, 
e e2>0 and m-— : =ee®, (34.13) 
On the second (“non-physical”) sheet we have 
m2 = ed gil2 eiletm/2 
4 ’ 
(34.14) 





= — e"? sinto. 


ay 
L 

3, 

| 
Alu 
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On the basis of (34.9) and (34.12) the momentum p(s) is defined as 


(a) on the first sheet 





ip(s) = — J 2 ; = je"? e”, (34.15) 
(8) =+ et e2, Imp(s)>0; (34.15) 
p 


(b) and on the second, nonphysical sheet 





ip(s) =- Jr = i = — iet? e02, (34.16) 

p(s) = — e"? e2, Imp(s)<0. (34.16’) 

In this manner the physical sheet of the surface (s) maps onto the upper 

half-plane of the plane (p) and the non-physical sheet maps onto the 
lower half-plane. 

Note also that in the stationary state the component of particle flow 


J that is directed along the radius vector has to be equal to zero. From 
this it follows that 


b(s) = a* (s*). (34.17) 


The condition that defines the discrete levels of our system, stable 
or resonant, states that 


b(s) =0, (34.18) 
and for stable levels 


~./—s=-iE and 0<E<2m. (34.19) 


The resonant levels lie in the continuous region of the spectrum. For 
these, we have 


By es as (34.20) 


where E > 2m, G>0. The choice of sign in (34.20) guarantees an exponen- 
tial decay of the resonant level for t>00. From (34.20) it follows that 


s = E? — 2iEr (34.21) 


(for G < E) so that Re s>0, Ims <0. 
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For the case of stable levels, from (34.11), (34.18) and (34.19) it follows 
that 


1 
Win t) = = a(s) exp(— e!/?r — iEt). (34.22) 
Note that on the second Riemann sheet we would have 
1 
Wi (r,t) = — a(s) exp(+ e/?r — iEt), (34.22’) 
r 


so that in place of (34.18) we would require that a(s)=0. Using (34.17), 
this condition would give the same level. 

We now return to the unstable level. If we remained on the first, 
physical sheet, then in order to get from the upper half-plane to the lower 
one to the point s=E?—2iEG (G<0), which lies below the branch cut, 
we would have to go around the point s=4m?, along the path indicated 
by the arrow in Figure 41. The argument of the function ip (s) [see (34.9)] 
would gain addition in phase of +2z— i.e., in this case 


i i2 : 
ip(s) = + e"? al; (p —n)+ | =- i"? exo(2), 


(34.23) 


where is defined by the expression 


T? 
ọ = — arc tan +2ran+ 0 (i) (34.24) 


2r 
E(1 — 4m?/E’) 
in which n=0 for the first sheet and n= 1 for the second. Therefore, on 
the physical sheet we have 
p(s) =- p ix, p =e"? cos $, = — g! sin 
(34.25) 
so that p’>0 and x’>0. Consequently, on the first sheet the wave func- 
tion (34.22) would have the form 


y(r, t) = ~ a(s) exp[— i(p’r + iEt)] exp[(x’r — ['t)]. (34.26) 


It is easy to show that this solution contradicts the continuity equation 
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for the current J because it decreases with increasing r(x’=0) and 
t(G>0). 

The correct wave function ọ;(r, £) for unstable levels is found on the 
second, non-physical sheet. In this case, according to (34.16’) we have 


o+2n 





p(s) = — £t? exp (i ) =p" + ix", 


(34.27) 


” 


p” = £1? cos Ps 0, x” =e! sin fs 0, 
2 2 
so that the wave function takes on the form 
1 
y;(r, t) = — a(s) exp [i(p’r — Et)] exp (x"r — Tt), (34.28) 
r 


which behaves correctly on the surface (r, t). 

We shall now investigate the space-time properties of the general 
asymptotic solution (34.5). In order to do this, we shall first change the 
normalization of (34.5) in such a way that the scattered wave u(p, r) 
assumes the form 

S; (p) exp [i (pr — Et)] 


u (p, r) = a ‘ (34.29) 





where S;(p) is an element of the scattering matrix S;(x) in a state with 
total angular moment J for elastic scattering with an energy E and 
corresponding momentum p. The normalization (34.29) corresponds to 
the initial asymptotic wave in the form 


exp [i (pr — Et)] _ exp [—i(pr + Et)] 


= 34.30 
Wor, t) Dipr ine (34.30) 
Comparing Equations (34.29) and (34.30) with (34.5) shows that 
a(s) 
S =—_ = (34.31) 
(p) BS) 
E — (Er + il) 
= + 34.32 
(Oe Gay (34.32) 


where we combined the factors that are regular close to the pole 
E=E,-—iG into the function g(s). 
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We now assume that the initial wave yọ is specified in the form of a 


wave packet 
+0 


Wr, t) = | c(p) Wo(r, t) dp. (34.33) 
Choosing w(r, £) to have the form of a wave packet allows the moment 
of particle impact to be localized about the time t=0. By considering that 
the function c(p) is concentrated about the point p= p,, we may rewrite 
(34.33) in the form 


y(r, th = am P [i (prr — Ert)] f (r — vrt) 





Š = exp [i (per — Ert)] f(r + vrt), (34.34) 


where ve=(0E/Op)p is the relative velocity of the particles near the 
resonant energy Ex Ep. The function f decreases outside the interval 


Ir + ot| > h/A, (34.35) 
where A is the width of the packet in momentum space 

c(p) zo (34.35’) 
for |[p—pp|>A. 


From this it follows that the collision takes place in the neighborhood 
of the time 
t=h/Avp 
(we assume that the width of the packet 4 is greater than the radius of 
interaction of the particle). 
Furthermore, we consider that 


goad: (34.36) 


where A is ae i The scattered wave which arises in the collision 
of the packets (34.34) has, in agreement with (34.29) and (34.33), the form 


— (Er + iP) 
— (Er — if) 
(34.37) 


u,(p, r) = ——— F c(p)exp[i(pr — ED] (9) 
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We set p— pr = č, so then 


TE a Fal | C (Pr + 6) exp Lig (r — vt] 
1pprr 
č — i (l /vr) 


From this we see that if g(s) and c(p) are such that the integration con- 
tour in (34.38) may be closed with a large semicircle,* then for (r — vgt) <0 
this semicircle will be drawn in the lower half-plane, and for (r —vgr)>0 
in the upper half-plane. Evaluating the integral gives 


ic exp [i(pyr — Ext)] 





u(p, r) =— 
Pr” 
1 
x exp [e — Ugt) A for r—vgęt<0 (34.39) 
and 
u(pr)=0 for r—vgęt>0. (34.39) 


In this manner causality is realized in resonant decay. Decay processes 
do not, in general take place in the region r> vgt which does not attain 
resonances that are formed for r~0, t=0 (to an accuracy of h/A). In the 
region r<vpgt resonances decay exponentially with a decay constant 
G=vp/A (see Figure 42). 


u(r) 





—> 
O PaUpt r 
Fig. 42. The decay of resonances is shown near the point r=0. In the region0<r<vef, the 
resonances decay exponentially In the region r>vet, their wave function u(r) is equal 
to zero. 





t If c(p) decreases sufficiently rapidly with increasing radius of the circle |p — pr|—> x. then 
the behavior of the other functions under the integral sign in (34.38) becomes unimportant. 
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This analysis of unstable states shows that the S-matrix can correctly 
describe the space-time behavior of unstable particles (“‘resonons”’ and 
“resonances”). The computations for non-relativistic resonons which 
further illustrate this theory are given in Appendix XI. 


35. CONDITIONS OF MACROSCOPIC CAUSALITY 
FOR THE S-MATRIX 


In 1942, Heisenberg put forward his prominent program of further 
development of quantum theory in which he introduced the concept 
of describing elementary particle events using the S-matrix instead of 
wave functions. The meaning of this was not refuted in the further 
development of experimental and theoretical physics. De facto methods 
which were based on the wave function concept were replaced by 
methods based on the analytical properties of the S-matrix. How- 
ever, a problem remains in that present-day mathematics of the $-matrix 
do not extend to regions in the small, to elementary events. 

Frequently, theoretical schemes which work only in S-matrices are 
similar to factories with only two departments, on which receives mate- 
rials and the other which packages the finished products. The depart- 
ment which processes the materials is lacking. The analytic continuation 
of the -matrix across a mass surface and energy surface allows us to some 
extent to view this “processing department” in which elementary events 
are “produced”. Nevertheless, the methods of analytic continuation are, 
to all appearances, not able to give a complete picture of physical phe- 
nomena in the world of elementary particles. 

We must admit that our present-day abilities are very limited. How- 
ever, there are two facts which support the use of the $-matrix. These 
are: (a) there do not appear to exist elementary particles which could not 
be described in terms of S-matrices, and (b) the S-matrix at least in prin- 
ciple is measurable. Thus we have a basis for considering the -matrix 
as a theoretical construction which may be able to preserve its significance 
in future theory. 

Thus we shall proceed in studying the S-matrix in the hope that it will 
remain able to account for observational results in the future. We must 
now formulate the causality principle directly in terms of the S-matrix 
without using field theories or microcausality conditions. 
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At first glance it may appear that there exists an irremovable contradic- 
tion in the formulation of the problem. The S-matrix is given in the 
momentum-energy space, in the multidimensional Lobachevski space 
R, (p). However, in defining the causal relationships, by their very 
meaning, it is necessary to use the space-time variables defined in the 
space R, (x). Because of this fact, causality can be formulated in terms of 
the S-matrix only to the degree of definiteness that is limited by the pos- 
sibility of using both spaces R, (p) and R4 (x). This possibility is character- 
istic of classical macroscopic physics. Therefore we shall call the 
causality which we shall study in this section macroscopic causality 
[69, 70]. 

Simultaneous use of both spaces R3(p) and R,(x) rests upon the 
principle possibility of constructing a wave packet which in the time 
t;=—Ttot,=+T (T is the asymptotic time) diffuses so slightly that 
the whole process of collision of two wave packets in a time 2 T may be 
represented by particles that are localized in space and time. The possible 
existence of such packets was shown in Section 18 (see [30, 74]). 

The conditions for describing collision processes using wave packets 
was also discussed in Section 18. These conditions state that 


a? 
T >R>a>1, (35.1) 


where Å is the wavelength characteristic of the packet, a is the packet 
dimension, R is the distance between the packets R= Tv and v is the 
relative velocity of the packets. Figure 43 shows two wave packets u; (x,) 
and u(x.) which describe particles in the initial state. In satisfying 
(35.1) these packets do not change much in the time t; = — T to t, = + T. 
We now turn to the mathematical description of the collision of these 
packets using S-matrices. We shall write the equation for scalar particles 
in an explicit form. According to (31.28) the S-matrix element, written in 

the momentum representation has the form 
Sp = <p E ip) + it (P — P) PP) (35.2) 


x/2Do --- 2Po 


The functions < p’||p> and T(p', p) that describe the transition from the 
state (P)=(Pw Pn-As-+%s Pı) to the state (P)=(Pm+m Pm+n-—19+++9 Pat 1) 
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may be written in a more detailed manner in the form 


<p'| Î ip) ome I] ô? (Pr+s = P,) for m= n, (35.3) 


s=1 
otherwise <p’| I |p) =0 and 
T (p,p) __ T Pmtns Pmtn—15 +++ P2 Pi) 


V 2Po -+ 2Po ree Vaia oo S 


where the numerator is an invariant function of the momenta and p9, 
P$» --., Pes, are their fourth components. 





(35.3’) 





t=-T 





Fig. 43. Description of a collision using wave packets. Packets u, and u, are shown for 

tı = — Tin the apertures A and B. At the time ¢~0, they collide (in the blackened region S). 

Thereafter the packets separate and gradually diffuse. This increase in packet size is 
shown for the time ż,= + T. The scattered waves are not shown. 


For the sake of definiteness, we consider that there are only two par- 
ticles in the initial state ¥, and that these two particles are described by 
the wave packets u, (x,) and u,(x,) which belong to the class of packets 
which satisfy (35.1). Such packets may be represented by the integrals 


if o P 
uO) = gn | ü (P) exp (ipx) = (35.4) 
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where p° =(m? + p”)'/?. The wave function of the initial state ¥,, may be 
written in the momentum representation in the form 


PaPa pi) = 22 Od) (35.5) 


V 2p$ V 2p! 


Then the wave function for the final state F (P4, Pa) for m=0 (elas- 
tic collision) is obtained from (35.5) using the S-matrix transformation 
(35.2): 


Pou (P4 P3) = Žin (Pa, p3) + i| 0. + P3 — P2 — P1) 


x TOPe Psi Pao Pa) e jū Si P2 d*p, 
et P2) 41 (Pi “2p2p? a ee 
V 2p32p5 
(35.6) 


and the wave function for m>4 (formation of new particles during col- 
lision) has the form 


Pout (Pms Pm—19 +++ P3) =a ifon + Pm-1 +e + P3 — 


T (Pm Pm-19+++> P35 Pas Pi) 








— P2 — P1) 
V 20 2D 4 25 
x A d*p, dp, 
x ü (P2) ñ; P) 50 2p? (35.7) 
From (35.4) it follows that 
ŭ (p) 1 ‘ 
TA = Qn fu (x) exp[— ipx] d°x. (35.8) 


This relationship allows us to change over to the coordinate representa- 
tion in (35.6) and (35.7). In order to do this we express the amplitudes 
itz (P2) and ù, (p,) in (35.6) and (35.7) in terms of u, (x,) and u, (x2) using 
(35.8) and then we multiply the left-hand side of (35.6) and (35.7) by the 
exponential function 


1 exp [i (PmXm + Pm-1Xm- ee 
3/2 (m— 2) 
(an) xf Ape 2p? -1 





(35.9) 
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and integrate the product with respect to d3p,,, d*p~m_—1,-.-, d3p3. The 
result is 


Yoi (x4, X3) = Pin (x4, x3) + i | G (x4, X3 | X25 xı) 


X Uy (X2) u; (x1) d3x, d3x,, (35.10) 


and in the general case for m>4, 


Wo (Xs Xa sis age) St foan Xm—19 -9 X3 | X2, X1) 
X Uz (x2) Uy (x1) d?x2 d?x,, (35.11) 


where the function O (Xm, Xm—15-++» ¥3|%2, X1) is defined as 


G (Xm Xma-19 ++ X3 l X2, xı) 
_ PG (Xm Xm—19 ° X3 | x2, X1) (35 12) 
Ot, Oty i i 
where t, =x9, t; =x? are the fourth components of the vectors x, and x, 
and the function Go (Xm Xm-—1>++- X3|X2, X1) is equal to 





Oo (Xm Xm-1s n X3 | X25 xı) 


= | e+ Pn- ttp PPD 


x T (Pms Pm—19++*s P33 P2, P1) 
X EXP [i (PmXm + Pm-1Xm-1 + + P3X3 — P2X2 — P1X1)] 
d? m we ... d? 
Pa T Pa Sie (35.13) 


0 


2p.2Pm—1 = 2p? 





From (35.13) it is obvious that the function ©, (3.1), (3.2) is in fact one 
of the matrix elements T (3.1), (3.2) [see alse (35.2)], written in the 
coordinate representation. The functions ©, and Gy satisfy Klein’s equa- 
tion by any of their arguments 


(C2 - m) 6, =0, (G2 —m)6, =0; (35.14) 


s=1, 2,...,m. These functions describe the collision process in terms of 
free particles. They are analogous to the functions (Xm, X%m—1)+--» X1) 
that we considered in Section 24 [see (24.5) and (24.6)] but they are not 
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identical because the spectral decomposition of the functions s (Xm, Xm—1> 
..., X1) is not contained within the Lobachevski space R, (p). 

The functions $ (Xm, Xm-1, X2, X1) which are based on the concept of 
scattering matrix § in terms of local fields (see Appendix VII) may not 
exist whereas the function Ọ (Xm, Xm—15-+» |X2) X1) is based on the more 
general assumption of the existence of the scattering matrix S taken for 
real particles and on the energy surface. 

We shall now formulate the principle of macroscopic causality. The 
wave packets u(x,) and u(x.) which describe the initial state ®;„ (x3, x1) 
contribute negligibly to the scattered wave ®,,,(Xms Xm-1,--- X3) ifin the 
asymptotic region 


kxd R, litl T, (35.15) 
the points x,, X2,..., Xm lie outside the region defined by the conditions 

(x2 — x4)? = (t2 — t? — &, — x1)? > 0 (35.16) 
and 

(x,-x,?>0, ¢t—t,>T, (35.17) 

(x;-x,)?>0, ¢t,-t,>T. (35.17') 


(i=3,...,m—1, m). These conditions are illustrated in Figure 44 for 
elastic collisions (m= 4). 

We shall denote the region in which the conditions (35.15)-(35.17) 
hold by M(x). Then in terms of the function G (x)= (Xm, Xm—1 ++ -|X25 
xı), the principle of macrocausality may be stated thus: the functions G(x) 
must vanish sufficiently rapidly outside the region M(x) 


G(x) +0 inside M(x). (35.18) 


In this manner the macroscopic causality condition restricts the be- 
havior of the function G(x) only in the asymptotic regions. The words 
“must vanish sufficiently rapidly” must be understood to mean “‘signifi- 
cantly faster than the usual causal interactions vanish.” 

Note that these interactions vanish in a wide zone as 1/R. Therefore, 
non-causal interactions must vanish significantly faster than 1/R. If they 
vanish in a region that is closer to the wave zone of the signal, i.e., for 
R< Å where Z is the wavelength, then they will in general be unobservable. 
Therefore the macrocausal condition given above allows for wide mod- 
ifications in microcausality and, therefore, for changes in geometry in the 
small. We may say that the -matrix is characterized by its tolerance, its 
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t 


Ug (X4) 


U3(x5) 








Fig. 44. Macroscopic causality. ABC is the light cone at the wave packet u; (x1); A'B'C' is 

the light cone at the wave packet u, (x2). u3(x3) and u4(x4) are wave packets that describe 

the scattered waves. The relativistic separation of the four packets shown corresponds to 
the fulfilling of macroscopic causality. 


readiness to assimilate various theoretical schemes for describing elemen- 
tary particle interactions and their inner structure insofar as such changes 
take place in small spatial regions with dimensions ~ b and in small times 
= b/v. We shall call the theoretical schemes in which microcausality is 
observed acausal (or non-local, compare with Section 26). If acausality 
does not violate macrocausality, we shall call such schemes locally 
acausal 

Therefore, to construct such a scheme, it is sufficient that: (a) The 
-matrix remain locally acausal. Furthermore, we must recall the neces- 
sary condition that the $-matrix be unitary (31.3): (b) The S-matrix must 
be unitary.t Combining these two conditions in non-trivial and in fact 
is one of the basic problems of non-local theory. In the next section we 
shall consider examples of propagation functions that satisfy the con- 
dition of local-acausality and in Section 37 we shall give an example 
of constructing an S-matrix which satisfies conditions (a) and (b). 


* We may consider to what extent the condition of unitarity is really necessary, however 
omitting this requirement leads to very serious problems (see Appendix XII). 
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36. EXAMPLES OF ACAUSAL FUNCTIONS 


We shall first consider the acausal analogue of the causal function D, (x) 
which is symmetric with respect to the past and the future. D,(x) #0 for 
é= +1. Because of this it is very convenient for constructing an acausal 
influence function that is symmetric with respect to retardation and 
advancement. 

For simplicity, we shall consider the case in which the rest mass of the 
field particles that transmit the interaction is equal to zero. In this case, 
according to Appendix I [Equation (I.6)] we have 


i l 
DQ) =6(2)4+- 5, =P-P, 
TX 


1 
t=t,—-t), r= |x,—-x,|. eon 


By expanding 6(x*) and 1/x* we may rewrite this equation in the form 
1 
Dx) => [8* E- r) +o" (+ r)]. (36.1) 


We may now construct the acausal function D,(x, a) which approximates 
the causal function D, (x) for sufficiently small values of the parameter a. 
In order to do this we first change 


(awe Ieee (36.2) 
Pe nxt oh gt 

and 

= 3 

= 4g: OC ee. (36.2) 

TX a 
Then D,(x, a) may be written as 

A 
D, (x, a) = 6(x?, a) + i= 6(x’, a), 
$ (36.3) 





1 1 = 1 | 1 1 | 
nx ia? arj f Fia ltt t-t , 
tT =4+/rt+ia?, t=- r + ia’. (36.4) 


where 
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We now compute the field @(x,) which is produced at the point 
X(X2, t2) by the source Q(x) which is localized at the point x, (x1, ¢,) 
to within the space-time region PUDSET Let this source be defined 
by the equation 





Q(x) =p =) g (2) ert, (36.5) 
where w>0, wt>1. L~cT. In particular, 
Xi 1 L ty 1 T 
o(B)-t ate (4) = nie + T?’ Ge) 


as in Section 26. 
The field @(x,) which is produced by the source Q(x,) (36.5) may be 
computed, using the acausal influence function D,(x, a) (36.3), from the 


equation 
X; ti : 
9 (x2) = fe (=) d?x, f D, (x, a) g (z) e dt. (36.7) 


Placing D.(x, a) from (36.3) into this equation gives 





i 1 | (=) d5x, 
x = 
vee 2ni á L /r2 + ia? 


«| (el E Jar (36.8) 
g T: tti t— T2 Me : 


where t=t,—1,. In the complex plane t,=¢,+it{ there are poles at 
t,=,—1,, t=t—t, and t=+iT. The integral in (36.8) for w>0 may 
be closed in the upper half-plane with an infinitely large semicircle. The 
pole t, =iT gives a contribution which becomes vanishingly small for 
wT æ ase~® so that only the contribution from the pole t, =t3— t1, 
Im t; <0 remains. 

We thus obtain 


ae | (2) 
_ 2 72 
x e vrai") (36.9) 


T 





exp[iw(t, — ./r? —ia*)] 
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The expression 

ne E exp [iœ (t, — ./r* — ia?)] 
may be considered as the retarded Green’s function for an almost periodic 
source (with frequency œ). This function has no singularity at r=0 and 
for r>a it becomes the Green’s function 6,,,(r, t2) which was computed 
in Section 26 using D,(x). The proper character of the causality relation 
for r>a is guaranteed for this case by two facts. These are (a) the proper 
choice of singularities of the function D,(x, a) and (b) the choice of the 
source. The source Q(x,) is localized in the space-time region Q x L T> 
> 73/w and has a positive frequency w>0 which is responsible for emit- 
ting a quantum with a positive energy hw>0. In the case when w<0, 


hw <0 instead of (36.10) we would obtain the expression for G(r, t3, a) 
which approximates the advanced potential: 





(36.10) 


Siol 74,2 
Che =e ee) (36.10’) 


JP? + ia? 


This equation is obtained by closing the integral (36.8) in the lower half- 
plane where there are poles t£, = — Tand t, =t, — T3, Im t, >0. The inter- 
pretation of the functions 6,,,(r, t2, a) and ©,,,(r, t2, a) is the same as 
that given in Section 26. 

Note that if we had taken very small values of L and T, we would have 
come to a significant violation of macroscopic causality. To explain this, 
it is sufficient to consider the case of small T (wT ~ 1). In this case we 
cannot neglect the contribution to the integral (36.8) of the pole t, = +iT. 
Including this pole gives 


e or Xi) 3 1 
20%) oF fe OMG r Tra 


eer X; d3x, 
z% ; 36.9” 
in pE): -r Gey) 


The nature of the denominator in (36.9’) indicates that g(x.) contains 
a symmetric advanced and retarded influence. This violates macroscopic 
causality and it shows the extent to which it is important for the source 
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Q(x,) to be smooth in space-time for formulating causality. It is neces- 
sary that the quantity œw T be much greater than one. This condition agrees 
with the condition that the energy of the source be sufficiently well 
defined, both in sign and magnitude. 

As we shall show in Section 38, one cannot formulate the violation of 
microcausality in such a way as to ensure /ocalization of this violation 
without using the time-like vector n (n?=1). This vector has various 
meanings. In the case we considered it plays the role of the vector 
(œ, k) where œ is the characteristic frequency of the source and k 
is the characteristic momentum. In particular we saw that when the sign 
of œ is changed, the retarded potential is replaced by the advanced one. 

We now turn to an example of an acausal influence function in which 
the vector n is introduced into the influence function itself in an explicit 
manner. In this example, as in the ordinary causal theory, any influence 
function may be obtained by superimposing retarded and advanced 
Green’s functions. 

In particular 


D, (x) = 3 Dyer (x) + 2 Daay (x), (36.11) 


and according to (26.16) if we have a field source Q (y) then the field 
g(x) which arises in the point x is defined by the equation 


p(x) = | D,(x — y) Q(y)d*y. (36.12) 


In place of these causal influence functions D,(x), Dje (x), Dia (x), we 


shall construct acausal functions which are regular at the vertex of the 
light cone. For this purpose we shall use the form factor p(x, n) in 
which some time-like unit vector n appears as 


R 
p(x,n) =p (*), (36.13) 


where 
R? = 2(nx)? — x? 20 (36.14) 


is an invariant positive-definite quadratic form of the variable x.t We 
assume that the form factor (36.13) vanishes rapidly (e.g., exponentially) 


t In the frame of reference in which n=1, 0, 0,0, the invariant R?=1?+x?>0 
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for R/a— oo, so that the quantity a represents the radius of that region 
within which microcausality is violated. We assume that this form factor 
does have a singularity at the origin that is stronger than 1/R. 

The acausal influence function is defined by the equations 


Dyer (x, a) = [pi (x — &) p(é,n) dté, (36.15) 


Dav (x, a) = | Pas (x — 6) p (&, n) d*C. (36.15) 


According to these equations the changed ‘‘acausal” field ọ(x, a) is 
defined, instead of (36.12), by the equation 


ọ (x, a) = fe. (x — y, a) Q (y) dy, (36.16) 
where 
D,(x — y, a= D.(x — y — &) p (é, n) dté 


= 3 Dia (x — y, a) + 3 Daa (x — ya). (36.17) 


Introducing such a form factor is physically equivalent to assuming that 
each point in the field source Q (y) is diffused using the form factor (36.13) 
in a region R(x)xa* and, consequently, it produces the same field as 
that due to a charge that is distributed in a small region with dimensions 
of the order a*. Because of this localization of the form factor p(é, n) 
the behavior of the acausal function D,(x— y, a) for large |xo— tol and 
|x—y| i.e., for 


[Xo — tol > a, Ix—yl>a, (36.18) 
agrees with the asymptotic behavior of the acausal function D.(x—y). 


Thus it is possible to satisfy the conditions of macroscopic causality 
(35.15)-(35.17’). 


37. AN EXAMPLE OF CONSTRUCTING AN 
ACAUSAL SCATTERING MATRIX 


We shall proceed from the assumption that there exists a unitary S-matrix 
which is constructed on the basis of local theory which assumes that 
microcausal conditions are satisfied. We shall also assume that all diver- 
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gences are removed from the matrix by, say, renormalization. In this 
manner the original S-matrix will contain only matrix elements which 
describe real, physical processes. Consequently, the matrix elements will 
be defined on an energy surface (real particles) and will be written in the 
form (31.28). 

Our problem is to find the acausal scattering matrix S, that satisfies 
the macrocausality conditions and the unitary condition (compare with 
Section 35). We shall consider this matrix as “close” to some local 
matrix Ê (we shall explain the meaning of the word “close” later). We 
write the initial local scattering matrix § in the form 


§ = elt, (37.1) 
where # is the Hermitian phase operator such that 

ñ =ì* 
We consider that the interaction is not strong and that it is such that we 


can write the operators and ĝ in the form of a series in terms of the 
interaction constant g 


S=i+ ¥ g,, (37.2) 
n=1 
A= D d. (37.3) 
k=1 


Inserting these series in (37.1) we obtain 


x i RAI A 
DFD Oatbte-ten—k-1> (37.4) 
ab...¢ 


k=1 


where the products (Na, Ne» ---, Ne) contain k factors. From (37.4) in par- 
ticular, it follows that 


i 


§, =i, $, = iñ, Po For (37.4') 
In these equations the operator S is expressed in terms of the operator 
ñ. It is also important to solve the equations to obtain the operator 4 
in terms of the operator S. The structure of the matrix elements of the 
operator # corresponds to that of the scattering matrix elements [see 
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(31.28)] and so we may write 


lap) = 6 Sr — Tp, (37.5) 


V Po 2Po 


where n(p’, p) is an invariant function of the momenta p and p’. A replace- 
ment, of the matrix § by the matrix # is important; because if micro- 
causality is violated in some way, then the unitarity of the matrix S may 
also be violated and one may come to a contradiction. If however we 
use the operator 7, then changing from the microcausal operator 7 to 
the acausal operator 7, is sufficient to ensure that the new operator is 
Hermitian. A Hermitian operator satisfies the simple symmetry condi- 
tions which can be easily verified. The unitarity of the acausal scattering 
matrix S, is also ensured by these conditions. It is very difficult and often 
impossible to prove directly that the matrix S, is unitary. 

In order to prove that the matrix S, satisfies the microcausality condi- 
tions, we must use the space-time description which we must now for- 
mulate in terms of the operator 4. For this purpose we recall that, ac- 
cording to Section 35, the macroscopic causality conditions are for- 
mulated for the function © (x) [see (35.18)], which is defined by Equations 
(35.12) and (35.13). The matrix element which enters into the second equa- 
tion is directly related to the $-matrix by Equations (35.2) and (35.3). 

Using (35.2) and (35.4) we may rewrite the function T(p’, p) in the 
form of a series in terms of the interaction constant g and the quantities 
Nis Nos--+s Ne --- [See (37.3)]. Using the matrix multiplication law for a 
Lobachevski space [see (31.26)], it is easy to show that the expansion 
has the form 


i 
T (p', p) = gn: (P', pP) + 972 (P', p) + 3 gni(p’, p) + 


= gn, (P', pP) + g°n2 (P', p) 
i 3p” 
+59 | mi (P's PS "B+, (37.6) 
Po 
where p’ denotes the set of momenta Pm» Pm—1,---» P3 in the final state, 


and p denotes the set of momenta in the initial state p}, pı [see (35.2) 
and (35.3)].‘ 


t As in Section 35 we assume that there are only two particles in the initial state. It is easy 
to generalize this for a greater number of particles. 
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In this manner Equation (37.6) may also be written as 


T (Pre Pm-19 +++ P3» P2» pı) = gı (Pins Pm-1> <- Day Pı) 
+ 9°N2 (Pw Pm- 1 -> Po» P1) 





i d3p5 d3p't 
Eg Ro 2 Sa eer eee A n DEL 

59 Jno Pm—1> -> Pi Pi) i 
X- M1 (P3 Pis -> Pas Pi) ++. (37.7) 


Inserting this expression for the function T in (35.13), we write the func- 
tion ©, in terms of the matrix elements of the phase operator 7 


Go (Xm Xm-15 = X35 X25 xı) = [ton + Pm-1 Pit P3 — P2 — Pı) 
X {N1 Pm Pm- 1> -+--> P2 Pt) + 9712 (Pm Pm- 19 +++» Po Pa) 
i „m BPs dpi 
rie fm (Pm Pm-—1> «++» P2; Pi ” ” 
2 P20 2Pio 
xy (p> Pi» <.” Pz Pi) + e} 

X expi [PmXm + Pm-1Xm-1 +*+ P3X3 — P2X2 — PiX1] 
d?p” A? Pn —1 d3p! 
x oes ? 
2pm 2Pm-1 2p? 








(37.8) 











The function G (Xm, Xm- 1»--- X2, X1) which transforms the incident wave 
into the scattered wave is obtained from Og(Xms Xm-1,--- X2, X1) by 
differentiating with respect to x$ and x? [see (35.12)]. 

Thus the function Ọ (Xm, Xm-1,:-- X1) is a functional of the matrix 
elements 4,(p’, p), n2(p', p),... of the phase operator 7. Making use of 
the properties of these elements and using (37.8) we can verify that the 
conditions of macroscopic causality (35.15)-(35.17’) are fulfilled. 

We now turn to a simple example of an acausal matrix ĝ,. For definite- 
ness we assume that the original causal function S$ describes a process in 
the scalar field g(x), in which the interaction energy has the formt 


W =9:97(x):, (37.9) 


where @(x) is the scalar field operator, and g is the interaction constant. 
Using the usual Feynman diagram method, it is easy to show that the 


t Compare with Appendix VII. In this appendix we remark that the system with interaction 

go? is not stable. However, if we limit ourselves to perturbation theory, then this fact is 
pe y 

unimportant. 
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matrix elements of the operator # to the first order are identically equal to 
zero 


4, =0. (37.10) 
The second-order matrix elements n, which describe the scattering are 


equal to the matrix elements of the operator ŝ, [compare with (37.4)] 
and they are described by the sum of the following diagrams 


P2 Pa 
Pa Pa 
N2 Ss + 
Py P3 
Py P3 
(37.11) 


The analytic expression for this element is 


n2 (Pas P3» Pa» P1) = {D (p3 — p1) + D. (p2 + pi}, (87.12) 


where Ď,(q) is the transform of the causal function D,(x) and q is the 
momentum that is transferred along the inner line in the diagram 
(37.11). 

According to (37.8) and (37.12) for elastic scattering we have 


Bo (X4; X3, X2, X1) = [oe + P3 — P2 — Pi) {D.(p3 -= pı) 


+ Ď, (p2 + p1)} expi(paX4 + P3X3 — P2X2 — P1X1) 
d? d? 

Noell ones (37.13) 
2p4 2pi 


We assume that the initial state of the particle ¥;, (x2, x,) may be defined 
in terms of wave packets u,(x,) and u,(x,) of the type considered in 
Section 35 [compare with (35.4)] 


Pin (X2, X1) = Uz (X2) Uy (x1). (37.14) 


Using Equation (35.10) we obtain the scattered wave for the elastic col- 
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lision case 
P our (X4 X3) = Uz (x4) Uy (x3) + i f d*x d*yD* (x4 — y) 


x D* (x3 — x) D. (x — y) uz (y) u, (x) 


, x+y 
+if atx ayot (xa - ; ) 


x p(x; = E *) o(* 5 *) ty (x) u; (x), 
(37.15) 











where the first integral is defined by the first diagram in (37.11) and the 
second, by the second diagram. The computations leading to this result 
are given in Appendix XIII. The function D* (x) is defined in Appendix 
I, Equation (1.10) and D, is the causal function, This expression for the 
scattered wave is based on conventional local theory in which the 
microcausality conditions are satisfied. 

We now replace the function D.(x—y) which enters into (37.15) by 
the acausal function D.(x— y, a) 


D,(x — y, D= | Bux y- O06 n) d*¢. (37.16) 


where the form factor p(č, n) depends on some unit time-like vector n 
and the elementary length a. This form factor is assumed localized in 
the region R, (E) xat. The vector n will be defined later. As we showed 
in Section 36, replacing the causal function D,(x— y) by the acausal func- 
tion D.(x—y, a) assures that the macroscopic causality conditions 
(35.15)-(35.17') will be satisfied. 

From (37.16) we obtain the equation 


D,(x, a) = | expi pxD, (p) ô (p, n) d*p, (37.17) 


where D, (p) is the Fourier transform of the function D, (x) and p(p, n) is 
the Fourier transform of the function p (č, n) 


p (¢, n) = foo n) expipě d%¢. (37.18) 
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As an example of the form factor p(é, n) we may take (see [69, 72]) 


— R/a 





p(,n) = RE (37.19) 
where 

R? =2(E,n)? — 2, (37.20) 
where a is the elementary length. From (37.18)-(37.20) we have 

~ M? 

p(p, n) = (37.21) 


2 (pn)? — p? + M?’ 
where M = 1/a is the large mass corresponding to the small elementary 


length a. In the coordinate system in which the vector n=(1, 0, 0, 0) we 
have 


M2 


D , 1) = —— c. 
P (p. n) RIDIN 


(37.21') 
We now turn to Equation (37.16) for the acausal function D,(x, a). We 


assume that the mass of the particles is equal to zero. In this case (see 
[70) 


1 1 

D(x) =-6*(t-n+-d (t+n, (37.22) 
r r 

where 
A 

1 ; 

os | etm dv, A>+o. (37.23) 
2n 


0 


We first compute the part Dt (x, a) of the acausal function (37.1 7) which 
is derived from the term with 5* (t—r) in (37.22). In the integral 


Di (xa) = f Dz (x — &) plén) dee (37.24) 
we replace ¢ by t—t (i=) and r(x) by r(x—§)=r(x)+o&+ ..., where 


is the unit vector in the direction of the vector x. Then using (37.23) we 
obtain 
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A 


Dž (x, a) -1f o6 n) dé dr | xp bivt r — t — aģ)] dv 


(0 
A 


= : | exp [iv (t — r)] P(v, v,an) dv 


0 


A 
1 M? 1 
z dv=- At 
= | exp [iv(t — r)] ——— a E 
0 
x (t — r, A, M) (37.25) 


for A>oo. 
The function A* (t—r, 00, M) is computed in Appendix XIV and this 
leads to the results (XIV.10) and (XIV.11). For |¢—r|> a, 


ah: “) = 3° o( 2 ) 
a =d*(t-—rnt+ T 


n exp( lt- i) 
+7 E E i (37.26) 
2 = a,/2 


For |t—r|—0 this function is regular and has the limits 


+ (o. 0, *) eed (37.27) 
a 4 a/2 


The function 6* (¢—r) has the form [see (1.9)] 














ôt (t—r)=ô(t— r) + : (37.28) 


nt~r 

Thus the second term in (37.26) which decreases as a?/(t—r)*, and the 
third exponentially decreasing term may be considered as corrections to 
the causal function 6* (t—r). In this manner one can verify that the con- 
ditions of macrocausality are fulfilled. 

We must now verify that the condition of unitarity of the scattering 
matrix is satisfied. According to (37.1) it is sufficient to prove that the 
phase operator ù, is Hermitian. Using (37.12) and (37.17), instead of 
(37.12) we obtain for the acausal operator 4, 
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N2 (Pas P3; Po» P1) = D. (p3 — P1) À (P3 — Pi. n) 
+ D. (p2 + pı) Õ (P2 + py, n). (37.12) 


From this it follows that if the following equations are satisfied 


P(p3 — pisn) = 
P(P2 + pı, n) = 


P* (Pı — P3» n), (37.29) 
p* (D3 + Pa, n), (37.30) 
then the matrix element n(p4, P3, P2, P1) Will be Hermitian if the vector 
n does not change when taking the Hermitian conjugate. For this it is 
sufficient that this vector be symmetric with respect to interchange of the 
momenta of the initial and final (p4, p3) states. In particular, we may 
write 


festa. (37.31) 


JP 


where P =p, + p2= p; + p, is the total momentum of the colliding par- 
ticles. The second possibility is that 








n= , (37.32’) 


where TI =p, +p,. The frame of reference in which the vector H =0 is 
called the Breit system. 

When the vector n is chosen to be symmetric, the acausal matrix S, 
is both unitary and macroscopically causal. We perform the calculations 
to an accuracy of g?. However the calculations can be carried to higher 
orders in the expansion of the operators S and # in terms of g. 

Therefore, we can summarize the results of this section in the follow- 
ing way. If a microcausal unitary scattering matrix exists (at least to an 
accuracy of O(g%) where N is some positive integer), then an acausal 
unitary scattering matrix ĝ, which satisfies the principle of macroscopic 
causality also exists. 


38. THE DISPERSION RELATION FOR THE ACAUSAL S,-MATRIX 


Any violation of microcausality is immediately reflected in the analytic 
properties of the matrix elements of the S$-matrix. In particular the 
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dispersion relation for forward elastic scattering which we considered in 
Section 30 would no longer be valid. The nature of the various pos- 
sible changes in these relations depend on the actual form of the viola- 
tion of microcausality. We shall consider a form of violating micro- 
causality which can reasonably be called minimal. 

Let T(k) be the elastic scattering amplitude in the usual local theory 
and T,(k) be the acausal amplitude for the same process. We shall con- 
sider that the violation of microcausality is minimal if (a) the function 
T,(k) has the same spectrum as the function T (k), in other words if the 
function T (k) differs from zero in some region R, (k), then the spectrum 
of T,,(k) is also confined to the same region; (b) the symmetry conditions 
which are characteristic of the amplitudes 7,,,(k) and 7,4,(&) (30.20), 
(30.20’) remain in force and, for the corresponding acausal amplitudes 
T,(k), 

Ty ret (k) = Tyaay (— k), (38.1) 

Te eet (K) = Tarer(— k); (38.1') 

(c) the rate of growth of | T,(k)| for k œ does not exceed that of | T (k)|; 
(d) the conditions of macroscopic causality are fulfilled. 

A simple form of acausality (violation of microcausality) that satisfies 
these conditions is discussed by Blokhintsev and Kolerov [69, 72] and 


we shall refer to it in the following. Let 7 (x) and 7,,(x) be the amplitudes 
T (k) and T,(k) expressed in the coordinate representation 


T (x) = | T (k) e7" dk, (38.2) 


T,(x) = row ei 4k, (38.2’) 
Following the concepts discussed in Section 37, we write 
T, (x) = | T(x — č) p (č, n) d*č, (38.3) 


where the form factor p(é, n) depends on some time-like unit vector n. In 
reference [69] this vector is taken to be parallel to the vector H = p'+ p, 
where p is the momentum of the nucleon in the initial state and p’ is its 
momentum in the final state. In Breit’s system, when p’+p=0, the com- 
ponents of n are n=1, 0, 0, 0. Furthermore, as in Sections 36 and 37, we 


MACROSCOPIC CAUSALITY 221 


assume that the form factor p(é, n) is a function of the ratio R/a where a 
is the elementary length and R? is the positive-definite invariant 


R? = 2 (né? —-@>0,... (38.4) 


which vanishes sufficiently quickly for R/a— œ as 


R 
p(é,n) =p () 0. (38.5) 


From conditions (38.5) and (38.3) it follows that if the components of the 
vector x are the quantities |x;|>a (i=1, 2, 3,4), then the quantity 
T (x—€) may be taken out from under the integral by setting €=0 and 
the integral of p(é, n) may be normalized to 1 in the space R, (č). Thus 
for |x,;|>a@ we obtain 


T(x) x T(x), (38.6) 


which may be considered as fulfilling macrocausality. 
Changing to the momentum representation in (38.3) we obtain 


T,(k) = T (k) p(k, n), (38.7) 


where (k, n) is the Fourier transform of the form factor p(¢, n). If 
p(é, n) is a function of R/a, then it is easy to show that p (k, n) is a funct- 
tion of the invariant 


I =2(kn’ — k 50. (38.8) 


From (38.7) it follows that the spectral character condition (a) is auto- 
matically fulfilled. The symmetry condition (b) will also be fulfilled if 


p(k, n) = p(- k, n), (38.9) 
p* (k, n) = p(- k, n), (38.9’) 


i.e., if (k, n) is even and is a real function of the vector k. Finally, con- 
dition (c) will be satisfied if the function 6(k, n) is bounded for |k|> 00. 

It is sufficient that the conditions (a)-(d) are fulfilled in order to 
derive the dispersion relation for the acausal amplitude T, (x) in exactly 
the same manner as that which is used in deriving the dispersion relation 
in the usual causal theory (compare with Section 30). Thus in the case 
of minimal acausality we may write the dispersion relation for the forward 
elastic scattering amplitude 7,(w) using Equation (30.30) directly and 
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differing only in that the singularity which arises from the form factor 
p(k, n)=P(@) must be taken into account when the integral is taken 
around the infinitely large semicircle. Then in place of (30.30) we obtain 
the more general equation: 
2 2 
g + g 
O+0, w- 








T,(@) = 


g 





e 
g 5 ca” + Y (o), (38.10) 
where the additional term y (œw) has the form 
Y (w) = (w — wo)"t! > Res Ss (38.11) 


The sum here is taken over the singularities (poles) of the function p (œ) 
and A,(q) is the imaginary (absorbing) part of the amplitude T, (œ). 

Because the function p(q) is real and even, if it has a pole at the 
point w=a,, then it must have a pole at œ= —q@,. These poles cannot 
lie on the real axis because in that case they would describe real states. 
Therefore, they must have an imaginary part Im w,40 which is suffi- 
ciently large. We let |Im w,|~1/a where a is the elementary length. 

If the quantity a is small and if the energy w in the region we are con- 
sidering is small so that w<l1/a, then we may close the integration 
contour (see Figure 33) with a circle whose radius R is such that R< l/a, 
yet is also large enough so that it is possible to neglect the contribution to 
the integral from this circle. Obviously, under these conditions we ob- 
tain the usual causal dispersion relation. For R>1/a, additional acausal 
terms appear. 

If the elementary length a is sufficiently small, then the acausal matrix 
S, differs from the causal matrix S only in the region where the energy 
œw» l/a= M. Therefore for Moo the matrix S, is “close” to the original 
causal matrix . We call such a matrix almost local or almost causal. 

In concluding this section we shall write out the dispersion relation 
s for nN forward scattering, taking into account its acausal form. For 
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definiteness we set 


M? 1 
POV Te eat Ma (38.12) 
These equations differ from the causal ones by the additional term w(x) 
which results from the computations of the function (œ) at points 
w= +iM. 
Instead of Equations (30.32)-(30.32”) we now have 


g 4 __ (w* — @) 


— @ er 
M? * (a? — 5) (wo — o2) 


Aao a 
To — wå) la a J7 


D,(@) — Dao (@o) = 








— w) 
+ Wo (a), (38.13) 
7 2g? w, (2? — oå) 
Dy (@) — Das (@0) = M (w? — w2) (w — w2) 





af _ te aa = + yı (œ), (38.13') 
w wp) 


W, (w? — w3) 





w 
[Da2 (œ) — Daz (o)] Oo Mie — 4) (05 — 4) 


42 _ J Anz (') dev’ 
00-08) | aa at ab 


+ y: (w), (38.13”) 





where the amplitudes D,o(@), Daı (œ), Da2 (0), Aao (©), Ag: (@), 4a2 (0) 
have the same meaning as the amplitudes of the causal theory in equations 
(30.32)-(30.32”) and the additional terms are equal to 


Yo (@) = a P(@) do (iM), (38.14) 
w*— we. ; 
Yı (@) M4 o? P (œ) d, (iM), (38.14') 
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w — wr | ; if 
W2(@) = M+ ae? d, (iM), (38.14”) 


where dp, d}, d are constants whose magnitudes are 
fe D,o(@) _ Dalo) _ Dalo) 
ooa $ ’ 25 
P(w) P(w) (w) 
at the point œ =iM. From these equations it is obvious that the additional 
terms are important only for frequencies w> M. 











CHAPTER VII 


A GENERALIZATION OF CAUSAL 
RELATIONSHIPS AND GEOMETRY 


39. TWO POSSIBLE GENERALIZATIONS 


In present-day theory it is assumed that the geometry which we know 
from macroscopic physics retains its significance in microworld regions. 
Together with this geometry, the form of causal relationships, which is 
based on studies of macroscopic phenomena, is carried over into the 
microworld. Two basic ideas form the basis of modern concepts. These 
are: 


(a) the idea that the Einstein- Minkowski space-time is homogeneous 
and isotropic, and 

(b) the idea about transfer of interaction by physical fields (electro- 
magnetic fields, meson fields, neutron fields, etc.). 


The systematic use of these ideas in elementary-particle physics in 
regions of especially small time and space intervals leads to absurd results. 
The interaction energy of particles with very small separations, and their 
characteristic energies are very large. This unwanted result appears in 
both quantum and classical physics and it indicates that the origin of the 
difficulties may be the same for both concepts. 

We shall discuss this question in greater detail. In classical physics, 
the propagation of a weak (linear) signal from the world point P(x,) 
to the world point P (x3) is defined by the Green’s function Ọ (see Section 
25). We shall try to construct this function starting from the most general 
assumptions on geometry and causality. 

From the assumption on the homogeneity of space and time, it follows 
that the Green’s function may physically be only the dispersion of the 
coordinates of the points Q(x,) and P(x,), i.e., a function of the separa- 
tions x=x,—x, and t=t,—1t,. The requirement that space-time be 
isotropic leads to a further restriction. The function © must depend on 
the four-dimensional interval x? =1? — x? = (t, — t1)? —(x,—x,)? as well 
as on the separations x,—x,, ¢,—1t,. Lastly, we may still introduce a 
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direction for the time e=#/|t}= +1 for x?>0 and a direction along the 
spatial line y=x/|x|=+1 for x?<0. Thus we may write the Green’s 
function in the form 


6 = G(x’, e, n). (39.1) 


This function is invariant with respect to Lorentz transformation, but 
may not be conserved for discrete transformation T (inversion in time) 
and P (inversion in space). 

The relationship between the world points P(x,) and P (x2) may be 
expressed, using the Green’s function, in the formt 


ọ (x2) = [ow E, n) Q(x) d*x, > (39.2) 


where (xz) is the field at the point P(x.) and Q(x,) is the source of 
the field which is concentrated in the neighborhood of the point P (x,). 
We may analogously express the field @(x,) which arises at the point 
P(x,) from the source Q(x,) which is concentrated in the neighborhood 
of the point P (x). The relationship expressed in (39.2) does not in itself 
ensure that the relativistic principles of causality will be fulfilled which 
(a) limit the propagation velocity of all interactions by an upper limit of 
the velocity of light and (b) require that the cause [source Q located at the 
point P (x,)] produce a result [field @ at the point P (x,)] at a later moment 
in time, i.e., t£, >t;. In order to satisfy these requirements, we obtain the 
variation in the field @(x,) at the point P(x.) for variations in the source 
Q(x,) at an arbitrary point P(x,) or, as it is more convenient, we shall 
compute the functional derivative of the field @(x,) with respect to the 
source Q(x,) directly (see Appendix IX). From (39.2) we have 


ôo (x2) = 
òQ (xı) 


From this we see that requirements (a) and (b) will be satisfied if the 
following restrictions are placed on the Green’s function : 


G (x?, e, n). (39.3) 


6 = ©(x?, + 1), if x?>0,£=+1, (39.4) 
6=0 if x?>0,e=-1, (39.4') 
=o; ff x<Oe=+! G94" 


* Compare with Section 25. 
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The parameter y does enter into the function & here because it equals zero 
for x? <0. 

When Equations (39.4)-(39.4”) are satisfied, then so are the relativistic 
causality requirements, but this does not yet mean that the functions © 
can describe any physical field. In order that the function may be con- 
sidered as a physical field! produced by a momentum source, it must 
satisfy the linear, inhomogeneous differential equation 


LG (x?) = Q(x), (39.5) 
where the source has the form 
Q (x1) = ô$ (x1) = 69 (x1) ô (t1). (39.6) 


The linear character of the operator Ê follows from the assumption that 
the signal being considered is weak, and that the impulse character of 
the source Q corresponds to the usual definition of a Green’s function 
as an elementary signal which is produced by a disturbance in the neigh- 
borhood of only one point P (x,). 

From the assumption of relativistic invariance, the operator £ must 


be a function of the D’Alambertian 
8? 0? 8? 8? 








=— + 
Dest a2 ae of 
so that 7 
L=L(Q). (39.7) 
The function L may be either a polynomial or an infinite series 
N 
C n 
Lo =) 2; (39.8) 
2n! 
n=0 


Here N is either finite or infinite. In the latter case the operator L(0) 
may be non-local, i.e., by operating on some function f (x), it reproduces 
the function in some other point ọ(x’), x’#x.* The Fourier transforms 


6(x2, + l= | 6 (k?) e dfx, (39.9) 


t? Therefore ıt is possible to have a generalized interaction which does not reduce to the 
concept of a physical field. Such interactions are discussed by the author in [24, 74-77]. 

* This fact was pointed out in reference [74]. In reference [78] it is shown that this difference 
depends on the behaviour of the limit |C,,|!" for n— œ. The nonlocal operator from the class 
(39.8) was used to construct the version of the nonlocal field theory that is independent of 
dispersion [79]. 
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of the functions (39.4)-(39.4”) are functions of the invariant k? =w?—k?: 
where kx = wt —kx. Inserting (39.9) into (39.5) we obtain 


£(— k?) G(k?) = 1. (39.10) 


From this it follows that the zeros of the function L(— k?) are poles 
of the Green’s function 6(k?).t These poles cause singularities in the 
function G(x?) on the light cone x? =0. In Appendix I, where we derive 
the Fourier transformations (39.9) for the Fourier transforms 6(k?) 
which have first-order poles, we show that these singularities have the 
form (x7), In x?, 1/x?. 

We could have assumed that the situation at the vertex of the light 
cone changes in the sence that the usual geometric relationships loose 
their meaning because of changes in the nature of the phenomena them- 
selves which take place within small separations in space and time. In 
this case there should exist some “elementary length” a which would 
characterize the scale which restricts the use of the usual causal relation- 
ships for large |x| and |t|, in proportion to the approximation of |x|, |t| 
according to the magnitude of the scale a. The possible significance of 
such a scale for elementary particle physics was first discussed by Watagin 
[82] and Markov [83]. Since then, the idea about the existence of a scale 
which restricts the region of applicability of modern quantum theory and 
maybe the theory of relativity has been discussed many times and, to 
this day, theoretical thought returns to it. 

A more important fact related to the concept of an elementary length 
is that the metric of the Einstein-Minkowski space is indefinite. In the 
region of the manifold R, (x) in which pseudo-Euclidean geometry is 
applied without question, there does not exist any concept of closeness 
of two events P(x,) and P(x.) except that which is expressed as x? =0. 
This equation only expresses the fact that the points P(x,) and P (x2) 
may be connected by a light signal and that their spatial, r=|x.—x,|, 
and temporal, t=t,—t, separations may be of any size. The equality 
x? =0 “does not know” why it is fulfilled; whether it is because, although 
both the quantities r and ¢ are large, r=t, or because r=0 or t=0 [24]. 


t Equations of the type (39.5) were considered in [80, 81]. For the case N > 1, problems due 
to the fact that the energy-momentum tensor of the particles becomes infinite were dis- 
covered. 
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We now assume that the principle of relativistic causality is satisfied 
at least approximately. We also assume, first of all, that this applies to 
the point (a) which restricted the propagation speed of a signal to a value 
less that the speed of light. With such an assumption, the function 
G (x?, e, n) no longer has to vanish outside the light cone and so condition 
(39.3) is no longer satisfied. If we remove supposition (a), then the func- 
tion G(x’, e, n) does not vanish in the spatial region. In this case an 
interaction in which it is possible to distinguish between directions on 
the path y= +1, and therefore not conserve parity, becomes possible. 

The departure of G(x?, e, n) from conditions (39.4)-(39.4”) does not 
have to be very large. For large time intervals ta where a is the ele- 
mentary length, Eq. (39.4’) must be satisfies to a high degree of accuracy. 
We introduce the elementary length into the expression for © (x?, e, n): 


2 
6=6 (x & N, z) (39.9') 


and we consider that for |x?/a?|— 00 


2 


2 G £, n, =) > G(x’, 8), (39.9") 


where G(x, e) is the Green’s function from causality theory (i.e., it 
vanishes outside the light cone). This correspondance requires that for 
|x?/a?|>1 the influence function © (39.9) becomes vanishingly small. 
In fulfilling this requirement, the signal which goes from the point 
P(x) to P(x) will propagate with a velocity v=(x,—x,)/(t,—t,)=x/t 
which is practically equal to the velocity of light c=1. 

In fact, from the condition 


|x?| = |t? — x?| < a? (39.11) 


it follows that 
2 


a 
|1 — y?l < T (39.12) 


so that for |t|>a, v?=1. This condition is relativistically invariant. In 
fact, in a reference frame that moves relative to the first along the Ox 
axis with a velocity u, we have 
x =y(x— ut), t=y(t—ux), y=(1— u? "?. 
(39.13) 
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Correspondingly, 


2 2 2 
magje oes) 
12 


27 Baw (39.14) 


where v'=x'/t'. If |x| and |\t|~a and |u| is small, then |x'| and |t’| are 
also ~a for ux 1. From (39.14) it follows that |v’| is close to 1 except for 
the point 1—uv=0. However, at that point 
t=0, x soei = w= (1 =u, 
u u 
so that |t’| and |x’| are small (on the order of a). If |¢’| and |x’|>a, then 
from (39.14) we have v? x1. 

Therefore, if we violate the propagation law for signals in the region 
|x|, |t| ~@, we may nevertheless conserve the restriction |v|<c=1 for 
|x| and |t|>a, i.e., for macroscopic scales [85]. 

The second causality condition (b) requires special consideration. Ac- 
cording to this condition, t, >t, so that e= + 1 [see (39.4)]. The difficulty 
lies in the fact that the difference between s= +1 and e= — 1 has mean- 
ing only inside of the light cone x?>0 and it cannot be formulated for 
Green’s functions which differ from zero outside of the light cone, i.e., 
for x? <0. Therefore, the limiting transition (39.10) cannot be realized 
if the retarded nature of the interaction is preserved. 

The origin of this difficulty is directly related to the fact that the metric 
of Einstein-Minkowski space is indefinite and that the concept of close- 
ness of two space-time points is lacking. We can surmount this difficulty 
only by taking into account the existence of matter as sources of fields. 

The essence of the matter is contained in the fact that, in order to 
define the closeness of two points P(x,) and P(x) in the Einstein- 
Minkowski space, apart from the radius vector (K=X2—X,, t2— t, =t) 
which connects the two world points P(x,), P (x2), we must have some 
other time-like vector n (this vector can be considered as unitary, n.=1, 
without any restrictions’). Using such a vector, we may form the in- 
variant (xn) in addition to the invariants x?, n?=1, and using these we 
may construct the positive-definite quantity (also invariant): 


R? =2(xn)? —x? > 0. (39.15) 


* We already used such a vector in considering causality [see (37.24) and (38.4)]. 
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In the coordinate system in which n=(1, 0, 0, 0), the invariant R is trans- 
formed into the quadratic form 


R =r +t, (39.15’) 


which is the positive-definite Euclidean distance between the points P(x,) 
and P(x,). Using this “distance,” the concept of closeness of the two 
world points P(x,) and P(x) may be defined as follows. The points 
P(x,), P(x2) are considered close if 


Rsa, (39.16) 
and distant if 
R>a. (39.17) 


This is the formal statement of the matter. The physical meaning of the 
time-like vector can be quite varied. We shall distinguish between two 
principally different classes of vectors n. In the first, (a) the vector n is 
related to the motion of matter located at points P(x,) and P(x,). For 
example, this vector may be identical to the four-dimensional velocity 
Ug of the source of the field Q which is located at the point P(x,), or to 
the velocity U, of the field reciever located at the point P(x,), or to the 
velocity of their center of mass Um. 

In all these cases, the vector n is related to the motion of matter that is 
located at the points P(x,), P (x2). We call the vector n internal in this 
case. Introduction of the internal vector n into the theory does not 
violate any of the relativistic relationships for free particles. It does not 
violate the homogineity or isotropy of space, and it does not result in any 
one coordinate system being preferred. 

The second possibility (b) is that the vector n selects out some prefered 
coordinate system. In this case we call the vector n external. Such a 
vector may be related to the selection of any coordinate system [105], 
for example, to the coordinate system that moves with the matter in an 
expanding Universe, or to the system that is related to physical 
vacuum (if this vacuum is considered as “ether’’). This coordinate 
system could be related to discrete quantum space [94, 104], or to the 
reference frame of the observer [91]. 


t This classification of possible types of the vector n was introduced in [69, 72]. 
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In concluding, we note that the Einstein interpretation of the variables 
x and ¢ looses its meaning in all acausal cases because, in the region 
|x|, |t|a, there is no signal which has a velocity c=1. However the 
finite dimensions of real particles themselves exclude the possibility of 
measuring the variables x and ¢ inside of particles and, therefore, place 
their physical meaning in question. The space-time description of particle 
structure may be no less of a fictitious construction than the classical 
electron orbits inside the atoms. 


40. EUCLIDEAN GEOMETRY IN THE MICROWORLD 


In this section we shall consider a model of causal relations which, for 
large distances, reduces to the usual causal relations that are based on 
the principle that the speed of light is constant in space, and for small 
distances, changes their properties to such an extent that the velocity of 
light can become imaginary. The first type of causal relation corresponds 
to the pseudo-Euclidean geometry of the Einstein-Minkowski space, and 
the second type, to Euclidean geometry of four-dimensional space. Such 
a model should seem neither too strange nor even too fantastic if we recall, 
for example, the peculiarities of nonlinear field theory. 

In Section 7 we showed that, under certain conditions in the regions 
of a large field ọ and large gradients of this field p=09/0t, q=0@/6x, 
the characteristic equations of the field can become imaginary. In this 
situation there arises what we have what we called the “lump” of events, 
the set of realities which are related to each other but do not result from 
each other. In other words, such realities cannot be ordered in time. The 
concepts “earlier” and “later” do not exist for them. Note that if an ele- 
mentary particle cannot be decomposed into constituent particles which 
could be joined by a light signal, then there is no basis for assuming that 
the causal relation inside of these elementary particles, or inside of close 
groupings of them formed in collisions, will be the same as that which is 
characteristic of events which are separated from each other by distances 
which exceed the elementary particle dimensions. 

In the following we shall consider the particle groups which, in 
experimentalists terminology are called ‘“‘events” or “stars”. Each such 
“event” or “star” is characterized by its rays which mathematically are 
described by the momenta p, of the particles which enter and leave (for 
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a star with N rays, k=1, 2,..., N).* These momenta (vectors) may be dis- 
placed parallel to themselves within the bounds of the complex, so that 
they all originate from the same point, for example the center of mass. 
We call such points the “focus” of the events. 

The existence of a focus violates the homogeneity of vacuum, and 
the vector p, introduces a specific direction which violates its isotropy. 
However, we shall assume that the “star” as a whole may be displaced in 
space and time to any value, and rotated to any direction without chang- 
ing anything inside of it. This means that we consider vacuum itself as 
homogeneous and isotropic, and that it is only the existence of the “stars” 
that de facto introduces a special point (the “focus of the star”) and 
direction in it. 

We shall now formulate these ideas mathematically. In modern field 
theory, particles are described by the functions s(x,, x2,..., Xy) in the 
expansion of the matrix S in terms of the operators of the field g(x) (see 
Section 29). The matrix element of such a coefficient function 


«Pw; Pn—-1> +--+» PN—m ls (x1, X25 +045 xXy)l PN-m-1> PN-m—29--+ P2 Pi> 
= [dry -ataista, X2, .--, Xy) EXPi(PyXy +: 


+ PN-mXN-m — PN-m-1XN-m-1 Z0 P11) (40.1) 


is equal to the probability amplitude of particle transitions from an initial 
state, characterized by the momenta p,, P2, -.-» Py—m—1, tO a final state, 
characterized by the momenta Py -ms -<-> PN- 

The functions s (x1, X2, ..., Xy) are translationally invariant. In order 
to express this characteristic explicitly, we introduce, in place of the 
coordinates x4, X2,..., Xy, the new coordinates which are related to the 
old by the linear transformations 


X, = Axt (æ=1,2,3,4), (40.2) 
æ= n (k=1,2,..., N), (40.2) 
Ee atxi («= 1,2, 3,4), (40.3) 
yak =0 G212.2N=—D: (40.3’) 


k 


t We broaden the concept of N-ray stars here to include neutral particles in the set of rays 
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The operation of translation T (d) 


T(d)x=x+4+d, (40.4) 
by use of the conditions (40.2’) and (40.3’), gives 

T(d)X=X+d, (40.5) 

T(dE=E. (40.6) 


In this manner, under the translation 7 (d), the space R, (X) is displaced 
by d, and the space R(&) goes over into itself identically. 

For the homogeneous Lorentz transformation, these spaces also trans- 
form independently. Let us carry out the Lorentz transformation x = Lx, 
where X are the new coordinates, in (40.2) and (40.3). The transforma- 
tions can be written in greater details as 


xt = Lygxh, (40.7) 


where L,, are the matrix elements of the matrix Ê. From (40.7), (40.2) 
and (40.3), we obtain 


N N 
X,= $ ALX = Lag } A*xh = LagX g (40.8) 
k=1 k=1 
and 
. N . N f? . 
či = J aby, = Lap yee SLs (40.9) 
k=1 k=1 


or, in short notation, 

XK=Lx, =ë. (40.10) 
In other words, under Lorentz transformation, the spaces R, (x) and 
R, (č) transform independently, each inside of itself. If we now perform 


the transformation of (40.2) and (40.3) on the variables x,, x3,..., Xy, 
then the function s turns out to depend only on the variables ¢: 


S(X4, X25- Xy) > S (Či, Fa, -- 5 én-1)> (40.11) 


i.e., it is defined in space by the relative variables of R4y_4(6). 
Furthermore, this function which is itself invariant under Lorentz 
transformation, depends only on the scalar products 


T* = EE og cick (i,k = 1,2,...,N—1). (40.12) 
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where 


_ f0, for af 
Joss“ 4 4 -1,-1,-1 for a=8 


so that we may write 


S(G15 625-065 ČN-1) = S( Čo ++) 
=s('?, P3,..., Fe...) 
= s(..., I*, ...). (40.13) 


We must remember here that of all the N(N—1)/2 invariants 7*, only 
(4N — 10) are independent. 

Thus the function s is defined in the space R4y -4 (€) and it is a function 
of the scalar products 7*™*. We now assume that the causal relation 
characteristic of local theory changes in the vicinity of elementary par- 
ticles. As was explained in Section 39, in order to define the concept of 
vicinity, the isotropy of space-time must be violated and this may 
formally be realized by introducing the function s(€,, &,..., ¢y—1) of 
time-like vectors n. Such a possibility exists in real “stars” because, by 
the very factof its existence, the star violates the isotropy of space and 
time. In particular, any of the vector n, (nf =1, myo >0) which define the 
direction of the star’s rays, 


n, = Pk , for k we 1, 2, 3, EN N, (40.14) 


Vo 


may be taken for the necessary vector n. We could also use their linear 
combinations — for example, the vector defining the direction of the 
motion of the star’s focust 
P N 
n= x PSY ee (40.15) 
JP i= 
Taking into account the anisotropy of space-time which results from 
the anisotropy of the real stars, we may generalize the coefficients of the 
function s(é,, €5,..., éy—1) so that their dependence on the directions of 
the world lines is made explicit — for example, 


S(S15 S25 -> Cn— 15 Mis Aas e Mw—1) (40.16) 


t Compare with Sections 37 and 39. 
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or, for a more special case, 


S(61, 2, -+- ČN-13 n). (40.17) 
Thus the following new invariants are introduced 

RË = 2 (në? — E72 > 0 (40.18) 
and 

R* = 2 (nč') (ng*) — Ee (40.19) 
or the nondimensional invariant 

` R 
a 3” (40.20) 


where a is the elementary length. Conserving the relativistic invariance 
of “events” as a whole, using the vector n, we may assume that the func- 
tion a (č, &,..., Čy-1; n) depends on the invariants R*(¢, n) (40.18) and 
(40.20) as well as the invariant /“* (40.12). We select this dependence in the 
following manner. Replacing the scalar product (40.12) by a more general 
one, we set 


I* = a(z) I* + b(z) R*, (40.21) 
where 
z = 46 + OM), (40.22) 
and the invariants /‘* and R* are defined by (40.13) and (40.19). 
The relativistic invariance of the coefficient function s(€,, €3,..., ČN-1; 
n), when the invariants /* are replaced by the invariants J‘, is fully con- 
served. However, macroscopic causality may be violated. To avoid this 


happening, we select the functions a(z) and b(z) in such a manner that 
for z-0, 


a(z)> 0, b(z)>1, (40.23) 
and for z> œ, 
a(z)> 1, b(z)> 0. (40.23’) 


Furthermore, note that by “vicinity of an elementary particle”, we 
mean that the quantity |x;—.x,| is small, and 


Ix; — xl <a. (40.24) 
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In the general case, this condition does not correspond at all to the con- 
dition that the invariants (40.18)(40.20), which define the smallness of 
the quantity |é'|, be small 


č <a. (40.25) 


These conditions may be brought into agreement only by special selec- 
tion of the variables €;, these being 


E,=Xy~—X, bg =X2—X3, Ent 
= Xy_-1 — Xy. (40.26) 


In satisfying conditions (40.23), (40.23’) and (40.26) for z=, we 
obtain the usual causal relation because, in this case /i*1"*. However, 
for z0, J'*R** so that the usual causal relation is completely de- 
stroyed and in place of a succession of events, we have their “lump”, 
which is not ordered in time. In this latter case, in the coordinate system 
in which n=(1, 0, 0, 0), the invariant J‘* has the form 


ik — Rik = = čiek (40.27) 


which is characteristic of Euclidean geometry. A new symmetry arises in 
this case. For a given n, for small ||, |¢5| we have four-dimensional, 
spherical (or elliptical) symmetry. If all the scalar products of the vectors 
ći, ¢*,... were defined in terms of the invariants J'*, then we could say 
that in the space 9R (é) there is a special geometry which is such that, for 
|€;|>a, it is pseudo Euclidean, and for |č;| <a, it is Euclidean. 

The result of measuring the distance between two points A and B in 
such a space would depend on the scale on which this length were 
measured, whether it were larger or smaller than the elementary length. 
This paradoxical situation should not be suprising, however, because 
small values of the coordinate č are unobservable. Since we restricted 
ourselves to satisfying the conditions of macroscopic causality in discus- 
sing the physically observable matrix elements of the scattering matrix 
(40.1), this paradox is hidden rather deeply. 


41. STOCHASTIC GEOMETRY 


A basic assumption of stochastic metric geometry is that, to each pair of 
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elements P, Q of stochastic space R(P), it is possible to associate a dis- 
tribution function Fpg(z) which can be interpreted as the probability 
that the distance r(P, Q) between the points P and Q is less than z. 
This function has the following properties 


1) Fpg(z) = 1 for all z only when P = Q; 

2) Fpg(z) = 0 for all z < 0; ia 
3) Fpo = Fop, symmetry; 

4) if Fpọ(z)= 1 and For(u)= 1, then Fpr(z + u)=1. 


These properties reflect the usual properties of distance in the metric 
space Rp (P): 


1) r(P, Q) = 0 if and only if P = Q; | 
3) r(P, Q) =r(Q, P); | 
4) r (P, Q) + r (Q, R) > r (P, R). 

This metric follows from the stochastic metric as a special case if 
Fpo(z) = H [z — r (P, Q)] (41.2) 

and 

H(z)=1 for z>0, (41.2’) 
H(z)=0 for z<0 (41.2”) 


(see [86, 87]). From a physical point of view, the stochastic metric leads 
to stochastic arithmetization of space. In fact the elements of the space P 
are arithmetized in a manner such that, to each element P which marks 
some specific physical event, a number x(P) (coordinate) is assigned 
which is obtained by measuring the distance r (P, 0) from some arbitrary 
element (0) (the coordinate “origin”). 

The stochastic nature of the distance r(P, Q) leads to the fact that if 
we measure 


r(P,0)= L, (41.3) 


then the actual distance / between the elements P and 0 is equal to L to 
the probability 


dW (L, I) = w(L, J) dl. (41.4) 
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Therefore, the arithmetization of the spatial manifold becomes stochastic 
and each measured coordinate x(P) of the element P may be assigned 
an actual coordinate ¥(P) only to some degree of probability. 

In the following, by stochastic space we shall mean that space ®,,(X) 
which is mapped stochastically into the space of the measurable co- 
ordinates R(x), and we introduce the probability 


dW (X, x) = w(X, x) dx. (41.5) 


This equation differs from (41.4) only in that the distances L and / are 
assigned + signs so that they must now be considered as coordinates of 
the point P which is measureable and existent. If our space is uniform, 
then the probability (41.5) is a function of only the difference X — x. In 
this case, in place of (41.5), we may write 


“ays 
a 





dW (X, x)= w( r (41.6) 


a 

where a is some scale length which is characteristic of the dispersion of 
the individual values of the stochastic coordinates x about the mean value 
x=X(P). 

A number of papers [88-92] have been written on stochastic space. We 
do not intend to. give a detailed analysis of these papers in this place. We 
can summarize their conceptual content in the assumption that the “true” 
coordinates (x, f) of a physical event P, because of some principal 
indeterminacy in the instrument used to measure x, t, or in the event P 
itself, can be defined only to some degree of probability. By this we mean 
that if we imagine an ensemble of laboratory assistants who all have 
identical, ideal instruments for measuring the coordinates of an event 
P, they would all obtain different results for x=x(P) which would be 
distributed statistically about the average value x= X(P) 


> X —x\dx 
z= x1) = f )Z, (41.7) 


a a 





with the characteristic statistical dispersion equal to 


E E TET 
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with the normalization condition 


PER). (41.9) 
a a 


Going into greater detail, it is possible to give at least three possible 
cases which lead to a stochastic space R, (X): 





(a) the ideal instrument for measuring distances has an irremovable 
indeterminacy Ax~a; 

(b) the objects “particles” with the aid of which points in space are fixed, 
have finite dimensions a, which leads to Ax xa; 

(c) the medium in which the signal used for measuring propagates is 
turbulent so that the signal velocity fluctuates. 


The main difference between the physical theory of stochastic space- 
time and the mathematical theory discussed in [86, 87], results from the 
fact that physical space has an indefinite metric, whereas the metric 
described by the distribution Fpg(z) (41.1) is assumed to be positive 
definite. Because the metric of physical space-time is indefinite, Eq. (41.6) 
cannot be generalized to real space-time simply by increasing the number 
of dimensions. In fact the invariance of the probability dW of the 
Lorentz transformation requires that the probability density w be a func- 
tion of an invariant. The possible invariant is the interval 


s? = (X — x)? = (T — t}? — (X — x). 


However, the probability 


2 4 

dW = »(5) = (41.6) 
a) a 

is not normalizable and cannot represent the distribution of the stochastic 

variable x( P ) which is concentrated about the point P (and not about the 

light cone s? =0). 

In order to obtain a concentration of the distribution about P, we must 
have a time-like vector n as well as the four-dimensional vector (X — x). 
It is possible to construct an invariant that we met earlier, I (n, X— x), 
with such a vector [see (39.15)], and to write the positive-definite distance 


Ras 
R=+4+./2(n, X — x} —s? 30. (41.10) 
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The probability dW may now be written as a function of this distance 


R\ d*x 

dW (X — x, m= (E). (41.11) 
a a 

This probability may be concentrated about the point X(P) and it may 

limit the value of the rms deviation 


eo R\ d* 
ae [x -n E) Seer. (41.8’) 


Inserting the vector n into the equation for the probability dW, we again 
face the problem of choosing this vector (see Section 39). In accordance 
with the possibilities (a) and (b)? given in that section, we shall first 
discuss the former. 

(a) The vector n is tied to the reference frame of the observer so that 
the direction of the vector n corresponds to the direction of the world 
line of the observer (compare with [50, 91]). In this case averaging physical 
quantities which depend on the stochastic coordinates x of the world 
point P will lead to results which depend on the observer. 

Let P(x) be a physical quantity, a function of the stochastic variable 
x=X-—6€. Its observed average value will be 


F(X) = F(x) = fræ + é) w(é, n) dté 
= | F (Q) e'2* w(Q, n) d*O, (41.12) 


where F(Q) is the Fourier transform of F(x), and @(Q, n) is the Fourier 
transform of the probability œ(¢, n). The vector n has components 
n=(1, 0, 0, 0) for all observers. Therefore, the quantity @(Q, n) will not 
be invariant under changes in coordinate systems. This is particularly 
evident in the following physical reasoning. Let F(x) be a plane wave. 
In one reference frame, it may have a frequency w such that wa <1 (ais a 
measure of the dispersion of the stochastic coordinate x) and, therefore, 
its mean value F(x) is practically the same as its true value F(x). In the 
same reference frame but for a second observer, because of the Doppler 
effect, the value of wa may be >1 and, therefore, |F(X)|<|F(x)|. 


t We shall return to possibility (c) in Section 42. 
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Thus in a theoretical scheme which is defined by a vector n that is tied 
to the observer, special care must be given to quantities whose invariance 
is not in doubt. Such quantities are, for example, the total probability 
of forming a specific number and type of particles. This quantity could 
depend on the reference frame only in the case that a special reference 
frame exist (for example, the system related to physical vacuum, taken in 
the old sence of some type of medium such as “‘ether’’). 

(b) We now turn to the second possibility in which the stochastic 
nature of the coordinates x( P) depends on the particle dimensions being 
finite. In this case a physical event P is identified by the presence of par- 
ticles with finite dimensions a at this point. The coordinates x(P) can 
now be stochastic because it is not possible to define which point inside 
the particle (to within the limits of 4x~a) scattered the signal, as if 
it were not localized accurately in space-time. 

We can imagine that space-time is filled with particles which do not 
differ from each other, as is assumed in the Lagrangian method of 
describing fluid motion. In this case the time-like vector n in (41.11) 
can be identified with a four-dimensional particle velocity u. Relativistic 
invariance of the probability dW (X — x, n) under changes from the ref- 
erence frame of one observer to that of another will be observed because 
the vector u now satisfies the Lorentz transformation. The assumption 
that the variables u and č =X —x commute, 


[u, €] =0. (41.13) 


is implicit in the assumption that the probability depends on these 
variables. Therefore it is natural to consider the variable č as one which 
describes the internal structure of a particle. 

In particular, in reference [89] the rest mass of a particle m is as- 
sumed to be represented by the operator m which depends on the internal 
coordinates č and the four-dimensional velocity vector u of the particle. 


That is, 
2 


Med =m t al aig Cag) * bem | 
m(é, u) =m +& PAEIT "aE, + ČpČu — (Up w) i 
(41.14) 


t This assumption of accurate localization of a signal is equivalent in this case to tne 
assumption that the instrument used to measure the particle coordinates is ideal. 
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By changing to the system in which the particle is at rest, n=(1, 0, 0, 0), 
it is easy to show that the metric in the space R(¢) is the Euclidean 
metric I(¢, €)=€ 6, +626, + €363+64€4.1 The wave function of the 
free particles which have a momentum P has the form 


Wp (x, E) = pp (E) e'®*" (41.15) 
Placing this in Dirac’s Equation (14.1) gives 
Liy,P, + m(E, P)] op (č) = 90, (41.16) 


where m(€, P) is the operator (41.14), in which the speed u is expressed 
in terms Of the total momentum u= P/(P?)!/?. This curious theoretical 
scheme has, however, not been investigated fully yet. 


42. DISCRETE SPACE-TIME 


In this section we shall analyse the ideas concerning discrete, “quantized” 
space-time. The concept of “quantized” space-time itself is not suf- 
ficiently well defined. We shall use this concept here in relation to 
theoretical schemes in which some discreteness of space-time is assumed 
which does not result from any material motion and has a purely geo- 
metric character. In its simplest form, the idea of discrete quantization 
of space-time is contained in the assumption that the points in a space- 
time manifold do not form a continuum, but represent some discrete, 
denumerable, or even finite set. In particular we can assume that it is 
represented by points on some lattice (see [94—97]). 

In assuming some discreteness to the space-time manifold, we en- 
counter two problems, namely, (a) the problem of transforming such a 
manifold from one coordinate system to another moving one (relativity), 
and (b) the problem of the limiting transition from discreteness to the 
continuum (correspondence). 

These problems may be illustrated by an example of a four-dimen- 
sional manifold D, (x) in which each point X is defined by the coordinates 
xı =Ma, X,=n,a, X3;=n3a, t=va where nı, nz, n3 and v are whole 
numbers and the length a is the grid constant. Let (x, t) be a field that 
describes some physical reality which can propagate in the space D, (x). 


t This metric, which is based on condition (41.13), was derived in [93] ın which relativistic 
gyroscope theory 1s treated. 
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We use the word “propagate” here to mean that (a) there exists some 
relationship between the value of the field y at a point x(x, t) and at its 
neighboring points, and (b) we assume that it is possible to divide the 
manifold D,(x) into space, D3(x), and time, D,(t) (see Section 3). 
Furthermore, we assume that the field y is linear, and using “close” to 
mean neighboring points, i.e. points with the coordinates 


Xı + a, X2, X3, É; X1, X2 £4, X3, t; 
Xis X2, X3 ta, t; Xis X2, X3, t +a; 


we may write the equation for y as 


a (x4, X2, X3, t) 

+ By [Y (x; + a, x2, x3, t) + Y (x; — a, x2, x3, t)] 

+ ba [W(x1, X2 + a, x3, t) + Y (x1, x2 — a, x3, t)] 

+ B3 [Y (x1, X2, X3 + a, t) + Y (x1, X2, X3 — a. t)] 

+y [Y (x4, X2, X3, t + a) + Y (x1, X2, X3, t — a)] = 0. 

(42.1) 

If the manifold D,(x) is homogeneous, then the coefficients a, B,, $2 
B3, y do not depend on x and ¢. If it is also isotropic in the variables x,, 
X2, X3, then 8, =f, = 3. Separating out time, we assume that the signs of 
the coefficients p and y are opposite. Then Equation (42.1) may be written 
in the form 


(a+ oB + 2y)W + Bl Ay + 4oW + 43] + 74 = 0, 
(42.2) 


where 4,, 43, 43, 4, are the second finite differences of the variables x,, 
Xz, X3 and t respectively. By definition, 


Aw HW. x, +4.) W(X) a, n) — QW... Xe) 
(42.3) 


where the dots denote the arguments which do not undergo the displace- 
ment +a. 

In order to abbreviate our studies of Equation (42.3), we shall restrict 
ourselves to the special case of «+ of +2y=0, B = —y so that in place of 
(42.2), we have 


A.W + 4. + 43 — Ay = 0. (42.4) 
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This equation has eigenfunctions that have the form of plane waves 
y = const. ef 0-9, (42.5) 


where the momentum IT(/7,, H, I) and energy e are related by the 
dispersion relation for these waves, which is obtained by substituting 
(42.5) into (42.4) in the form 


I I IT 
sin? (2) + sin? (24) + sin? (=) — sin? (=) =0. 


(42.6) 


Equation (42.4), its solution (42.5), and the dispersion relation (42.6) all 
have the symmetry of a cubic lattice in the space R, (IT) and, therefore, 
they are invariant with respect to the group symmetry of this lattice. Thus 
transformation in one of the planes (J7,, €) just interchanges one of the 
coordinates x, and time t, and one of the momentum components I], is 
simultaneously interchanged with the energy e. 

At first glance, these discrete and finite transformations do not have 
anything in common with the Lorentz transformations. However, for 
\Ta| <1, the dispersion relation (42.6) has the form 


M? + 1? + 113 — è = O (Ia’), (42.7) 


where O(ITa*) denotes the remainder that is of the order of (ITa)*. The 
finite differences in (42.4) may simultaneously be replaced by the second 
derivatives so that in place of (42.4), we have 


Oy y y ay 
et Oe Oe Oe 
Thus for sufficiently small momenta |I|, Lorentz invariance still holds. 
For larger momenta, |II| ~ IT/a, the situation changes essentially and only 
discrete transformations, characteristic of the lattice, remain valid. 
Disregarding such a correspondence with the usual theory, we cannot 
avoid the existence of a preferable coordinate system in this theoretical 
scheme, this being the system of the lattice D,(x). This system has the 
peculiar feature that the usual relations of the special theory of relativity 
are valid for particles that have a small momentum in this system. 
We now turn to another approach, to the idea of a discrete space- 
time based on the coordinate x, as the displacement operator x, in a 


= 0 (Ha) y. (42.8) 
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curved momentum space R, (p) [98]. The most direct way of construct- 
ing such operators goes back to the concept of bounding the square of 
the momentum?: 


P=- (42.9 

where a is some characteristic length for this theory (Planck’s constant h 

and the velocity of light c are taken to be equal to 1 here). Condition (42.9) 

physically means that the square of the mass of the particle is bounded; 
m? =1/a?. 

We now introduce the homogeneous coordinates no, N1, N2, N3 and n4 
such that 


1 
nee” (u = 1, 2, 3, 4). (42.10) 
a No 
In terms of these coordinates, Equation (42.7), for the limiting values of 
p, becomes 


na — ni — ni — n3- ni =0. (42.11) 


Together with this quadratic form, we consider the five-dimensional 
hypersphere 


na — nt — ni — 13 — n =- a? (42.12) 


a 


and we shall define the operators X, as operators of infinitely small 


rotations which leave the hypersphere invariant. These operators are 


0 ô 
s= iaf- nz) «= 1, 2, 3, (42.13) 
ôn, ĉno 
0 0 
Na No. 


where X, =? is the operator of time. These operators may be supplemented 
by the angular momentum operators 


ô 0 
E ye By = 1, 2,3, 42.14 
Ê, (w R Z) a, P, y (42.14) 


t We may also consider a second limitation, namely p? > —1/a? (see [100]). 
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ô ĉ 
M= ino =} a=1,2,3. (42.14) 
ôn, M4 
It is easy to show that the nontrivial permutation laws give 
Pes te) = ia? L, (42.15) 
[X4, %,] = ia?M,, (42.15) 
[Ra Bp] = i (Sag + a7 Bap), (42.16) 
[X4, ĝa] = i(1 — api), (42.16’) 
[Ro Ba] = ia? PaPa. (42.16”) 
In the same manner, from (42.14) and (42.14), we obtain 
L, = Xb, — £, Pg, (42.17) 
M, = Saba + Saba. (42.17) 


i.e., the operators Ê, and M, are components of the usual four-dimen- 
sional angular momentum operator. In a similar manner we can also 
obtain the operators X,, ¥4=?7in the momentum representation 


ô 0 
2, =i | — + a?p,p 2), a= 1,2,3, (42.18) 
(=; "Ôp, 


ŝ, =i (2 — apap, =): (42.18’) 
These operators act in the space R,(p) in which the restriction (42.9) 
holds. The eigenfunctions of these operators which satisfy the require- 
ments of being finite and single valued, are obtained from the equations 
XW, (p) = x, (p), (42.19) 

ty, (P) = tY, (P), (42.19') 


where x and ż are the characteristic values of the operators % (we dropped 
the subscript «) and ?. It is easy to show that 


W,.(p) = exp [im arctg a (p)], (42.20) 


a(p) = ap, ./1 — a?p?, (42.20’) 


and m is an integer. 


"Ey 
lll 


where 
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In a similar manner, we obtain 


W,(p) ~ exp [itf (p)], (42.21) 


where 


Posie, (42.21’) 


/1 = a?p? 
Thus the spectrum of characteristic values of the variable x is discrete: 
x=ma,m=0, +1, +2, and the spectrum of the variable t= rta is con- 
tinuous. Such a ‘quantized’ space was first obtained by Snyder [98]. 

The geometry of the space R, (p) in which the operators discussed 
above and their eigenfunctions are defined may be completely represented 
by the geometry of a manifold of points which arise as a projection 
of the hypersphere (42.12) from its center onto the plane nọ = +a [99, 
100]. Such a three-dimensional case is shown in Figure 45. By making 


























p p 
H. ans H An 
p 
>n No 
H’ H 
p p 


(a) (b) 


Fig. 45. (a) Projection of the hypersphere 73 — n} — n= —a? on the plane no +a. HH and 

H’H' are two branches of the hypersphere; pp is the plane ny) = +a; and n’ are two points 

on the hypersphere whose projections on the surface pp are the point p [in the space R, (p). 
(b) The same projected onto the plane (1), no). 


such a projection, it is easy to show that we arrive at Equation (42.10) 


wheret 
a 


/1 i a’ p? 
t Other possible versions of the geometry of the space R, (p) may be obtained by displacing 
the center of the projection along the no axis (cf. [100]). 


No = (42.22) 
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From (42.10) and (42.12) we obtain the equation for the square of the 
length element in the space R,(p): 


do? = (1 — a2p?)"![dp? + (1 — a*p*)"'(pdp)*], (42.23) 
where 

dp? = dpå — dp? — dp} — dp3, (42.24) 

p dp = p4 dp4 — Pi dp, — P2 dp2 — p3 dps. (42.24') 


The volume element corresponding to this metric is equal to 


dQ(p) = ./ — g dpa dps dp2 dp; 
= (1 — a*p?)*/? dp, dps dp. dp,, (42.25) 


where ./—g=(1—a?p”)°/? is the determinant constructed from the 
metric tensor g,, which is defined by (42.23). The operators for X,, £4, 
Ê, M, introduced earlier are Hermitian in the space R,(p) with the vo- 
lume element dQ(p). 

We may now define the operation of translation in the space R, (p). 
Translations in the space ®,(p) represent rotations of the hypersphere 
(42.12). Therefore, these translations do not commute. Displacing the 
vector p to the vector k [denoted by p(+) ky] is not identical to displacing 
the vector k to the vector p [denoted by k(+) p].' The transformation of 
the wave function g(p) in the space R,(p) for infinitely small displace- 
ments of the space to the vector k may be written as 


p(p(+) k) = 1 — ix, k,@ (p). (42.26) 


The calculations show that the operators £, in (42.26) correspond to the 
operators (42.18) and (42.18’), as is expected from the fact that displace- 
ments in the space R,(p) are equivalent to rotations in the space R, (n). 
The fact that momentum addition is not associative causes the usual laws 
of energy-momentum conservation to become approximations in 
Snyder’s system. 

Practical applications of this refined geometrical theory to quantum 
field theory has led to serious difficulties [101, 102]. These difficulties are 
related to what are called the angular divergences which arise in the 
Feynman diagram calculations in momentum space. These divergences 


t For details, see [100]. 
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are not important in the ordinary theory because they may be removed by 
rotating the integration path in the plane p,, p = |p|,' through z/2. Under 
such rotations, the indefinite metric in the plane (po, p) changes to the 
definite, Euclidean metric. In the case of Snyder’s geometry in momentum 
space, the only modification that arises in computing the Feynman dia- 
grams results from the fact that the factor (—g)'/? appears in the volume 
element (42.25). From (42.15) it follows that this factor has a branch cut in 
the complex plane (po, p) for p? =p% — p° = 1/a?. This branch cut does not 
allow a rotation through z/2. In references [101, 102] attempts were made 
to first perform the rotation, then to introduce Snyder’s metric and then, 
using analytic continuation, to return to the physical region of the in- 
definite metric. Such an approach proved successful in certain cases.* 


43. QUASI-PARTICLES IN QUANTIZED SPACE 


An original equation, equivalent to that of Tomonaga-Schwinger but 
more convenient for use in curved momentum space, was formulated in 
reference [104]. 

Let g=nx be a space-like plane, where n is a unit time-like vector 
(n?=1), directed towards the future (ny >0). According to (28.14) we 
have . 

. dU (o, — œ) 
: do 7 


where Ñ (øo) is the interaction energy (in the “interaction representa- 
tion”).* According to (28.18), the S-matrix is equal to 


W (a) Ô (0, — œ), (43.1) 


$=1+iT=O0(+a, — %0). (43.2) 

We write the operator U(a, — œ) in the form of the integral 
dt 4 . 
z Re”, (43.3) 





> 1 
O10, -o1 | 
2n JT 


t Note that the analytical properties of the matrix elements ın the usual, local theory allow 
such rotations. 

* Reference [103] must also be mentioned here. In this work, a geometry of momentum 
space with variable curvature is used. The difficulties discussed here may perhaps be 
overcome in such a geometry. 

t A=1 in (43.1). 
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where R(nt) is some new operator and zt is an invariant parameter, a 
conjugate of o: e+ +0. From (43.2) and (43.3), for a + œ, it follows 
that 


§=1+iR(0), (43.4) 
ROO)=T. (43.5) 

We now set 
Fio) = | W (nt) el dr (43.6) 


and substitute (43.3) and (43.6) into (43.1). It is easy to show that this 
equation will be satisfied if the operator R(nt) satisfies the condition t 


+a 


R(nt) = W (m) + = | Wn — nt’) 
2n 


, 


Rut’). (43.7) 





t — ie 


= 
We shall consider the special case of a self-interacting scalar field p(x) 
with a cubic interaction * : 


W (x)=g:0(x}:. (43.8) 


Assuming 


@(x) = Ons? | eik*6 (k) d*k, (43.9) 


we obtain, using (43.6), (43.8) and (43.9), 


5 g 3 
W (nt) = —— foton — k, — ky — k3): (kı) 
J2n 
x õ (kz) ĝ (ks) :d*k, d4k, d*k,, (43.10) 


where the vectors k,, kz, ka denote the particle momenta. 
Integrating (43.7) gives the operator R in the form of a series in terms 
of the interaction constant g: 


R (nt) = 2 R,(nt), (43.11) 
s=1 


t A more direct and detailed derivation of (43.7) is given in Appendix XV. 
+ This special case 1s easily generalized and is only given here for simplicity. 
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where 
A 1 A dt a 
R, (nt) = (Qn! | WM) ae, W (nt, wR nt) Kore 
dt,_ z 
TREE, E a (43.12) 
T— T- — ie 
Rewriting this equation in normal form gives 
3s 
Ry(nt) = Y | 84 (ne — ky — ky == km) 
m=1 
x Rents kı, kz, e km): Ø (ky) @ (k2) es 
x Ö(km): d*k, dtk, +- d+k,,. (43.13) 


Here the functions S® are integrals of the products of the functions 
A, (k)=0(k4) 6(k? —m?) and g(t,)=1/(t—t,—ie). Because of the pro- 
perties of the factor A, (k) in this theory, all particles in the intermediate 
states are real particles for which k?=m? and k?>0. The virtual par- 
ticles in the intermediate states that are characteristic of the usual 
perturbation theory are expressed in this new treatment by the appear- 
ance of additional integrations over the variables t4, t2,..., T,-1, that are 
contained in the function g(t,). These additional integrations must be 
taken into account in constructing diagrams to represent the functions 
R,(nt). 

The necessary generalization of the Feynman diagram method can be 
carried out as follows. We compare the continuous outer lines, as usual, 
to the operators of the field @(k), and for t #0, we complete their outer, 
dotted lines in such a manner that the conservation law 


ky t+ ky +-+ k, —nt=0. (43.14) 


is satisfied. We compare the solid inner lines to the functions A, (k), and 
we represent the propagation function by dotted lines. These lines join all 
the vertices. Some of the simpler diagrams are shown in Figure 46. 
Note that in going over the physical S-matrix, we must set t=0. 
Therefore the simple diagram (a) does not describe a real physical 
process, because it would describe either the disintegration of one of the 
particles into two, or the opposite process of synthesis, and both of these 
would violate the energy-momentum conservation laws. The two dia- 
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AT 
ky At -q ni 
/ 
k A ke 
aT q 
ae 
3 gq 

Ay k2 

k2-ATG|-9 
(a) (b) (c) 


Fig. 46 Some of the simpler diagrams in the theory of quantized space. (a) The vertex for 

three real particles k,, k3, ka and one “quasi-particle” At; (b) diagram of the proper 

energies of particles taking into account the possible emission or absorption of “quasi- 

particles” At; (c) diagram for the scattering of real particles k,, k2, k3, ką; At is the 
“quasi-particle”. 


grams in Figure 46b are diagrams of the characteristic energy. Integration 
over the variables q and t is assumed in these diagrams. The more com- 
plicated diagram in Figure 46c describes particle scattering. On the whole, 
the complete mechanism of real-particle interactions can be described as 
an exchange of real particles in which “quasi-particles” with momenta 
NT, NT,...,NT,-1 (dotted lines) particinate, and the variables t4, t,..., 
T,-, have values between — œ and +œ. 

It appears that all divergences in these calculations arise in the 
integration over these variables, i.e., over the momenta of the quasi- 
particles in the intermediate states. Thus the disagreements may be 
avoided by restricting the region over which the variable 7 is integrated. 
This may be done by changing over to the curved p-space and assuming 
that the component points of the momentum are not situated on the direct 
path from + œ to — oo, but ona circle of radius A/a, where a is some ele- 
mentary length which relates the distance between diametrally opposite 
points on the circle. In this case the integration is restricted to within the 
limits 

th th 
—— Ims. 
2a 2a 
For a—>0, the points lie on the direct path and the limits of integration for 
t extend from — œ to +0. 

The group of translations along the direct path, and the group of 

translations on a circle on which the diametrically opposite points are 


(43.15) 
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identified, are isomorphic. The differences in transformation lead to 
substantial changes in the addition of momenta which, on the circle, are 


2nh 
Pi(+)p2 =p, + pz to modulo =e (43.16) 


In other words, momentum addition has to be carried out within the limits 
—h/a, +nh/a. Then Equation (43.7) may be rewritten in the form 
+n 


2 = 1 5 
Rin) = Wor + f W(n | t(—) 7) 
2n 


-n 


, 





—— R(nt’). (43.7) 
T — 1€ 


From this equation we obtain the unitarity condition 

R(0) — R* (0) = iR(0) R* (0) (43.17) 
and the relationship between the matrices 7 and Ñ; 

T=RO, (43.5’) 


Other relationships, characteristic of the theory for a=0, may be similar- 
ly obtained. 

However the transformations in going over to coordinate space change 
in the obvious way. All functions of the variable t in this new scheme must 
be periodic functions of t with a period 2xh/a. Therefore, the fourier 


transform in 7, i.e., along the vector n, must have the form 
+nħja 


o= | Toa, (43.18) 


— nħja 
This transformation has meaning only in the case when a spatial variable 
o=nx is added to t, where this variable is discrete and has values which 
are multiples of the elementary length, namely, 


0, + a, + 2a, ..., + ma. (43.19) 


In this manner, the assumption that this new group of transformations 
exists leads to a quantization of space-time in the direction of the vector 
n. On the basis of the new law of addition (43.16), the inner dotted lines 
may represent real quasi-particles with momentum 2zh/a. Thus the dia- 
gram in Figure 46 for nt = 27h/a will represent a real process in which qua- 
si-particles with large momenta take part. These processes are analogous 
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to the well-known “umklapp-processes” in crystal lattice theory in which 
momentum conservation for particles that move in the lattice has mean- 
ing only to modulo 2zh/a (where a is the lattice spacing). The momentum 
2nh/a is transferred in this case to the infinitely heavy lattice. The usual 
energy-momentum conservation law does not have any meaning in this 
scheme. This is because, when the vector n assumes the form of a discrete 
lattice, time in the direction n becomes nonuniform. Physically, this viola- 
tion may be interpreted as an absorption or emission of quasi-particles 
with momentum 2zh/a. 

The scheme described above of a quantized space-time posesses sim- 
plicity and elegance. A fine point which requires special attention is the 
dependence of physical quantities (momenta of real quasi-particles and 
scattering matrix elements) on the unit vector n. This vector specifies the 
direction in which space-time has a nonuniform structure, i.e., the special 
direction in which the isotropy of the space-time continuum that is 
characteristic of Einstein-Minkowski geometry, is violated. This situa- 
tion is similar to that of the unit time vector n with which the localness 
of acausality is ensured (see Sections 36 and 37). 

The following three interpretations are possible: (a) the vector n is 
“external” (see Section 38). For example, it may be related to the motion 
of the hypothetical “ether” or galaxies [2]; (b) the vector n is a vector 
characteristic of the observer so that it has the value n=(1, 0, 0, 0) in all 
reference frames. In this case we have the situation discussed in Section 
39; (c) the vector n is an arbitrary unit, time-like vector. In this case 
the square of the matrix elements T,;(n), the quantities | T,;(n)|?, would 
be proportional to the probability that, in the process f—i, quasi- 
particles, both real and virtual, having momenta parallel to the vector n, 
take part. 


44. FLUCTUATIONS OF THE METRIC 


In the regions of space in which the motion of matter is turbulent and is . 
accompanied by significant, random changes in the density or velocity of 
matter, the metric tensor g,, (8.1), becomes a random quantity. Similarly 
the temporal, tpg, and spatial, xpg, intervals which seperate two events 
P and Q become random quantities. In this case it is possible to speak 
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only of the probability that tpg =t and xpg=1.' 
We assume that the energy-momentum tensor of matter may be 
written in the form of a sum 


Ty = To + OT, 


pv? 


(44.1) 


where T9, describes the total motion of matter characterized by large time 
periods T and large scale lengths L, whereas the additional term ôT, 
results from turbulent material motions and is characterized small time 
periods t < T and small scale lengths å < L. The average value of 67,, in 
time intervals ~T and scales ~L is taken to be equal to zero 


dT, = 0, (44.2) 


where the bar denotes the average over turbulent motions. We expand the 
metric tensor in a similar manner: 


Guv = Ius + yy, (44.3) 
hp = 0. (44.3’) 
By using linear approximations for the quantities 67,, and h,,, we 


obtain from Einstein’s metric tensor Equation (8.3) an equation that 
holds for small 67,,, and h,, (see, e.g., [13] or [14]), namely, 





— 07h = Alyy, (44.4) 
where 
2 
2 — gt 44. 
O go ôx, ax,” ( 5) 
tav = Ody = tae. oT, (44.6) 
ôT = gi? OT, (44.7) 


“x= 8ny/c?, y=6.7 x 1078 cm? gm™! s~? is Newton’s gravitational con- 


stant. From Equation (44.4), we obtain 
Nyy (x) =— 2x Eeth (x) , (44.8) 


where Qz? is the inverse of the operator [2 (the subscript x indicates 
that the operator acts on the variable x). Using this expression for hy, (x), 


t This section may be considered as a discussion of the third possible cause of stochastic 
nature of space and time which was noted in point (c), Section 41. We follow reference [106] 
in this section. 
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we write the statistical correlation of the quantity h,,(x), taken at two 
points in space-time x and x’, as 


hn (xX) hag (x') = 4x? O? OF? Cuvap (% X’), (44.9) 
where the quantity 
Cyvap (x, x’) =, tay (x) tag (x’) (44.10) 


is the statistical correlation of the tensor component ¢,, taken at two 
points in space-time x and x’. 

In Section 8 we showed that the space-time interval between two 
events P and Q is equal to 


P 
Spg -Í | + Guy dx, dx,. (44.11) 
Q 


For small h,,, we obtain 


P 
h (x) dx, dx, 
Sro= Sot | — M ods, f (44.12) 


5 V t Gey dx, dx, 


where spo is the value of the interval spg for h,, =0. From (44.2) and (44.8) 
it follows that 








ASpo = Spo = s?o = 0. (44.13) 
Furthermore, from (44.12) the equation for the rms Asko =(Spg—Sg)” is 
P P 
—- dx, dx, dx, dx; 
ASfo = | | £ P__hy (x) Ingg(X’) 
V E go dx, dx, y/ £ gag dx, dy (44.14) 


or, substituting the expression for the correlation, (44.9) into this 
equation gives 


o dds | dx, 
Asžo = 4x? 
J JE 2, dx, dx, Gay Ax, dx, 


a: dx, 


_ + g% dx, dx, 


Or OR? Cpag(X x’). (44.15) 
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In the special case when the direction of PQ is along one of the coordinate 
axes, then Equation (44.15) simplifies to 


Xp(P) 


— d 
Fe ie | e 
Xu (Q) V £ Fun (x) 
x'a (P) d : 
xf FS? Noel). (44.16) 


/ 0 r 
x'u (Q) E Iuy(*) 


From this equation we see that the problem is reduced to one of com- 
puting the correlation Caunu (x, x’) and the quantity 


M suua (X x) = wO: Og Cua (x, x’). (44.17) 


We now turn to the microworld. In the microworld, the statistical 
nature of the metric may be a result of field fluctuations in a physical 
vacuum. The existence of such fluctuations, just as the fluctuations of 
electric charge in vacuum, may be considered as proven on the basis of 
the discovery and measurement of the Lamb-shift of the electron level in 
hydrogen.t 

In order to avoid unnecessary complications, we shall consider the 
fluctuations of a scalar field w(x) in which the particles have a rest mass 
of m. The Lagrangian in this case has the form 


sed ih 0@ 66 
L=—| gt — — — mô? }, 44.18 
-( Ox, OX. IRR ) ( ) 
and the corresponding energy-momentum tensor is 
2 0g 0 R 
| a E JE (44.19) 


ax, ôx, Havis 
where ĝ (x) is the scalar field operator and products of the form (d@/0x,) 
(6g/éx,) are assumed to be normalized (see Appendix VII). 


The vacuum average of 7;,,(x) is equal to zero — i.e., 
<0| Ê (x) 10> = 0, (44.20) 


t The possibility of this was first shown by the author (see [143] and [144)). 
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and so in this case 67, = T,v. Using (44.6), we find it easy to show that 
> (gy = £01) 26 (x) 








= — gah: 4. 
u (X ax, ax” Juv (4 21) 


Therefore, 


O@ (x) 6@ (x 
M punu (% x) = oro (| a ~Soun?6*eo) 
u H 
06 r 60 , 
| @ (x’) @ (x’) tome) :). (44.22) 


T2 
I + 
Ox, OX, 











In computing this quantity we must remember that the vacuum average of 

the products of the normal products is defined by the equation 
<0]: Ê (x) Ô (Y): : Ô (2) Ê (u) : 10> 
=—A_(x—2z)A_(y—u)+4_(x—u)4_(y—2), (44.23) 


where 4- (x) is a singular function that is defined in Appendix XIII.‘ 
From (44.3) it follows that 


o 00 (x), 00) 460) Ji 6A? (x — y) 
` Ox, Ox, dy, Oy, ôx, OXg Oy, Oy, 
(44.24) 





If we place the function 4- (x) by its Fourier transform A_(p) we obtain 
eipx dtp 


= SS 44.25 
p? +m? ( ) 


4- (x)= [4wer et -Í 
C 
(the contour C is shown in Figure 49), and using (44.23) and (44.24), 


from (44.22) we obtain 





xh? ( A_(p) 4_(q) ee *® 
M yuyu ($) = & 2 | 4 
c (p +q) 
x {p242 + guum? (pz + q2) + g3 mt} d*p d*q, 
(44.26) 
where €=x’—x and « is a numerical coefficient of order 1. It is easy to 
show that this integral diverges at the upper limit |p,| |qu|» K—>œ. For 


t In order to obtain this equation, all particle production operators had to be transposed 
on the left of the anthilation operators in the product of the fields p(x) pọ (y) @(z) ọ (u). 


260 SPACE AND TIME IN THE MICROWORLD 


č =0, it reduces to 


xh i 
M auuu (0) = B e K+, (44.27) 
and for €=t, Kt>1, to 
xh? K? 
M unuu (t) =y ee e cos Kt. (44.27') 
c 


where f and y are numerical coefficients. Because of this divergence, there 
is no point in calculating M „u44 (x) in detail. However, Equation (44.27) 
and (44.27) are useful in the sense that they show that the fluctuations 
in the metric tensor are due to fluctuations in the vacuum field and are 
important only in regions with scales of the order of 


1/K = A,, (44.28) 


where A, is the length 


h 
A, = =Z = Ar, = 0.82 x 10-32 om, (44.28') 


where A, =h/mc is the Compton wavelength of the particles and r, = xm is 
the gravitational radius of the particle. It is significant that the fluctua- 
tions do not depend on the mass of the particles of the vacuum field 
insofar as this mass m<h/A,c~ 107° gm.* Field fluctuations on a scale 
of the order of A,=h/mc give a negligible contribution to the metric 
tensor. 

From (44.28’) we can see that the critical scale A, is the geometric 
average of the gravitational radius of a particular r,, and its Compton 
wavelength h/mc. The length A, is significantly greater than the grav- 
itational radius of the particle, but it is less than the length that is char- 
acteristic for weak particle interactions, Ar ~1071° cm. Thus if weak 
interactions should play the defining role in the structure of elementary 
particles, then gravitational phenomena are either in general insignificant 
in the world of elementary particles, or they are buried at such depths that 
they can be reached only in dreams. 


t See reference [107] where ‘“‘maximon” particles with limiting mass m, = h/A,c are consider- 
ed. 
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45. NONLINEAR FIELDS AND THE QUANTIZATION 
OF SPACE-TIME 


In this section we shall consider the problem of quantizing a nonlinear 
field ọ (x, t) which is described by the Born-Infeld equation in two-dimen- 
sional space-time R2(x). This equation is obtained from the variational 
principle 


6S =0, s=-[Laxar, (45.1) 


where the Lagrangian is 
L=- (1 + ø- o)’, (45.1’) 


where 9,=0@/0x, ~;=0g/0t. This function is a special case of the 

Lagrangian that was considered earlier in Section 7. The scale of the field 

which is a measure of its nonlinearity is taken to be unity in (45.1’) 
From the variational principle we obtain the equation 


(L= 97) Pax + 29,990 — (1 — 92) Pu = 0, (45.2) 
where 
ao eo oo 
Pxx = 72? Px = a? Pu = T 


For po—>œ, this corresponds to gx, #:—>0, and (45.2) reduces to the 
linear wave equation 


Pxx — Pu = 0. (45.2') 


If we set z= p(x, t), then this equation may be considered as the equation 
of the minimal surface in space having the metric ds? = dt? —dx? —dz?. 
In fact the integral S in the variational principle is equal to the area of 
such a surface. Note that the equation which results from the Lagrangian 


L=-(1+ o +95)", (45.3) 


would give the minimal surface in the usual Euclidean space with the 
metric ds? = dx? + dy? +dz?. A soap film stretched on a given contour is 
an example of such a surface. 

At first glance, the problem of quantizing a nonlinear field does not 
appear to have anything in common with quantization of space-time. 
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However, as was shown in [108, 109], there is a direct relationship 
between these problems.' In order to demonstrate this relationship, we 
turn to Cauchy’s problem for Equation (45.2). We take the initial con- 
ditions for =0 to be 
p(x, 0)= a(x), — @:(x, 0) = B(x) (45.4) 
and assume that this function satisfies the condition that it be hyperbolic: 
1 + a? (x) — b?(x) >0. (45.5) 


In order to solve Cauchy’s problem, we introduce the variables «, 8 which 
are analogous to the variables a=x—t, p=x+t that are characteristic 
of the linear equation (45.2’), and we consider the variables x, t and z=@ 
as functions of the new variables a and £: 
x= x(a, f) t=t(a, P), 
o = ọ (t (a, P), x(a, B)) = z (x, P). (45.6) 
Introducing the vector r=r(«, $) with components t, x and z and 
defining the scalar product of rı and r2 as 


(r1r2) = tit2 — X1X2 — Z122; (45.7) 


we can rewrite Equation (45.2) in the form 


12 Dog — 2 (targ) Dag + 17D oa = O, (45.8) 
where 
r ôr ôr a?r e’r (45.8) 
a= Mg=ap la = ap laS ha an : 
ôx f ô ôa? aa AB 
and D,, is the determinant 
Coat Xpo. ijs 
Doo = |ta Xa Za (45.8”) 
tp Xg Zg 


Here the subscripts p and o have two values, « and £. The condition that 
the equation be hyperbolic gives 


(arg)? — rêr > 0, (45.9) 


t We follow these works in the following. 
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and the characteristics of Equation (45.8) are given by 
r? =0, r =0. (45.10) 


Equations (45.10) and (45.8) together form a system of equations 
from which the three unknown functions t («, £), x(«, P), and z(a, B) may 
be obtained. From (45.10) and (45.8) we have 


D.p = 9, (45.11) 
and therefore 
Tug = Ar, + Brg, (45.12) 


where A and Bare arbitrary functions of « and $. From (45.10) and (45.12) 
we have 


1ô , 
(Tapta) = (r,r 6) B = 235 r, =0, (45.13) 

1ô , i 
(ragt p) = (1.8) A = a= 0 (45.13’) 


and as (r,, rs) #0, A = B=0. In this manner we arrive at the final system 
of equations 


r? =0, r} =0, rag = 0. (45.14) 
From the third equation in (45.14), it immediatly follows that 
r(x, B) = r, (x) + r2 (P), (45.15) 


where r; and r2 are two arbitrary vectors. The first equation in (45.14) can 
now be written as 


ar, (N? ôr (B)\? _ 
(= ) = 0. ( a ) =0. (45.16) 


Making use of the fact that the variables « and £$ are arbitrary, we select 
them in such a manner that for r=0, «= £8 =x. Then the initial conditions 
(45.4) can be written in the form 


t(x, x) = tı (&) +t, (4) =0, (45.17) 
x(x, x) = x, (4) + x2 (a) =a, (45.17) 
z (x, a) = 2, (a) + z2 (x) = a (a). (45.17”) 
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We now write the derivative ĉọ (x, t)/ôt in terms of derivatives of t, x 
and z with respect to « and £: 






































Xa Za X4 (a) z(a) 
do Xp Zg x2(B) 22 (B) 
— = nml (45.18) 
Ot |X, ty Xi (a) ty (a) 
Xg tg x2 (8) t5(B) 
and for «=f we obtain 
xı (x) zı (a) 
dp X2 (a) 23 (a) 
= = b(a) = m, 45,18’ 
Be 8 a ee) eee 
X2(a%) tz(a) 
Further, from (45.14) for «=, we have 
tı (a)? — x4 (a)? — 24 (a)? =0, (45.19) 
ty (a)? — x (a)? — z} (a)? =0. (45.19’) 
We now write the desired result in the form 
B 
r(x, B) = 3[(p(a) +p] +3 | m(A) då. (45.20) 
From (45.17)-(45.17"), it follows that the vector p(a) is equal to 
p(x) = [0, a, a(a)], (45.21) 
and the vector n (a) 
n (x) = [7 (x), 2, (a), 2, ()] (45.22) 


is defined from (45.12), (45.19) and (45.19’). A few calculations lead to the 
result 


1 + d (a) — d (x) b(a), 
TS caer coca” 
N b(a) (45.23) 
n (a) = — TL’ 


where 
L(a) = [1 + æ (a)? — b (a)?]"?. (45.23’) 
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The quantities z,, 7, and z, have simple physical meanings. For t=0 


ôL 
z, = T(x, 0) = ae (45.24) 


t 
is the momentum which is the canonical conjugate of the field g. Then 





OL 
n, = G(x, 0) = — 7 (x, 0) a’ (x) = z (x) 3 (45.25) 
is the momentum density of the field ọ and finally 
ðL 
= H(x,0)= 30, g, -L=n7,9,—L (45.26) 


is the energy density of the field. on (45.20) may now be written as 








t(a, B) = tfma [H (4) — 1] då, (45.27) 
eee a fou di (45.28) 
9 2 2 > 
j B 
z(a, B) =4[a(a) + a(B)] +3 | n(A) da. (45.29) 


a 

From these solutions it follows that the variables ¢, x, and z, when 
considered as functions of « and £, depend also on the dynamic variables 
H, G, n, taken on the spatial surface t=0. The dependence of t and x on 
these dynamic variables is a result of the fact that in the case of the non- 
linear equation, the characteristic lines are not straight lines a=x—t, 
B=x-++t, but are curved, and this warping depends on the magnitude of 
the field ọ and its derivatives Px, Pr- 

Equations (45.27)(45.29) may be rewritten in the form 


é+t 

tét) =t} | [H (4) — 1] da, (45.27) 
g-t 
FT 

xë t =čé+} f G(A) dA, (45.28’) 


gmg 
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itt 
z(č, 1) =}[a(ë +1) +a(E—D] +4 | zoa, (45.29') 
é-t 
where 
pitt, a=g-1, t=- g= 
(45.30) 


We shall now consider the field y(é,1)=z(é, t) as the Heisenberg 
operator @(¢, t) in the space R,(, t). For t=0, the quantization law 
for scalar fields now gives 


h 
[ô (é, 0), #(€, 0)] = 7 (8 = 8). (45.31) 


The quantization of the field @ leads to the quantization of the quan- 
tities £ and x which become operators. Then x and ¢ no longer commute 
with the field @. In order to show this, we form the Poisson brackets 
[Ð (č, 1), ¢(¢’, t)] and [@(€, 1), x(¢', t)] and compute them to the first non- 
vanishing terms of order 1/@. In this approximation, the operators H(a) 
and G(a) in (45.27’), (45.28’) and (45.29) may be taken in the linear 
approximation 


G(y = âl), (45.25') 
Ou 
Pasis (2) ‘| 45.26! 
(a) — =z + n(x)’. (45.26') 
Then we obtain 
gr 
1 : 
[Ê (é, 1), t(&, t)] = o | [a (č +1) + a(¢ — 1), H(A] då, 
E (45.31’) 
3 1 7 
[(é, 1), x(č', 1)] = 402 | [a(é + T) + a(č ga 1), G(4)] då, 
Po 
eSt 


(45.32) 


where G(a) and A(a) are taken from (45.25’) and (45.26’), and the dimen- 


CAUSAL RELATIONSHIPS AND GEOMETRY 267 


sion of the equation is easily obtained by introducing explicitly the scale 
of the gradient of the field @o. 

Using the commutation law (45.31) and noting that a(é +1) is obtained 
from a(¢) by simply displacing the coordinates ë by +1, we obtain 


[A (E, 1), t (č, 1)] 








Ete 
Lh 
=z | r0 B= E+ 91+ 0-9) aa 
po 1 
stan 4 
h 
=z E+ +a- o), (45.33) 
[Ê (é, 1), x (¢', 1)] 
EFT 
1 h ĉa (à) 5 
=r f E [a = + 0} + STA — E- oN} dà 
cost 
_ 1 hfaag+r)  da(E—1) 
-i a ta } i as 


where the right-hand sides of (45.33) and (45.34) have to be set to zero if 
č lies in the interval 


&-—w< F< E42. (45.34) 
If two physical quantities can be represented by linear Hermitian 
operators A and B and if their commutator is Ĉ =[A ; Bh, then it is well 
known that the rms deviations 4A4?, AB? of A and B satisfy the in- 
equality 
AA? AB? > 4|C?| (45.35) 
in any state (see, e.q., [110]). On the basis of this generalized uncer- 
tainty relationship, from (45.33) and (45.35), we obtain the following 
relationships for the rms values Ag’, 4t? and Ax’. 





pe tee os 2 
Ag (1)? At(é, 1}? > am Iné +t) +a =T, (45.36) 
0 











h? \ép(é + 1) 4 ôe (E — 1)? 


2 2 
Ag (č, t)* Ax (č, 1) > Tot a ae 





(45.37) 
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From this it follows that for t >Q, there does not exist any specific value 
of the field ọ at a given point with coordinates x and t. The field ọ is not 
defined in the space ®, (x, t), but in the space R, (č, t). Because of this, 
for t >0, there does not exist any point in the space R, (x, t) which would 
correspond to a given point in the space R; (č, t).' 

From the commutative laws it is also evident that “quantization” of 
the space-time R, (x, £) can only arise in the regions of the space-time 
R(é, t) where the nonlinearity is large. For 


Es m(€) > 0 and zl ge > 

Po Po OF 
the variables g, ż and x commute because ¢ and x become C-numbers 
again. The example of the nonlinear field discussed in this section is 
interesting in that the quantization of space-time is not introdiced 
externally, but it results from the dynamics of the nonlinear field. 


0, 


t In the approximation we considered, however, that x and ¢ do commute. 


CHAPTER VIII 


EXPERIMENTAL QUESTIONS 


46. CONCLUDING REMARKS ABOUT THE THEORY 


The main idea of this monograph is based on the assumption that the 
S-matrix, which asymptotically describes microphenomena, will retain 
its meaning in future theories. This assumption does not imply that the 
methods of constructing this matrix will not undergo great changes and 
differ significantly from present-day methods, be they based on the tradit- 
ional concept of Lagrangian interaction of local fields, or those based 
on the more abstract concepts characteristic of the axiomatic approach. 

A second basic assumption used is that modern free-particle theory is 
valid. In other words, we assumed that modern theory asymptotically 
correctly reflects physical reality for large separations between particles. 
These assumptions served as the basis for considering possible de- 
viations from present-day theory in regions with very small scales in space 
and time. We considered two main classes of possible modifications to 
modern theory which, by convention were characterized using the vectorn 
“internal” or “external” (see Section 39). 

We now turn to the first class. The current concept of uniformity and 
isotropy of space-time is conserved in this class. Moreover the asymptotic 
momentum space R, (p) (see Section 31), and the space-time R, (x) are 
conserved in the form in which they are familiar to us in modern theory. 
The possible deviations from modern local field theory themselves have a 
local character, i.e., they are concentrated in small regions of space-time 
(with volume ~at where a is the “elementary length”) in the neighbor- 
hood of the interacting particles. Using the schemes we had considered 
for local violation of microcausality (see Sections 36-40), we attempted to, 
describe a “magic” circle — a region in space-time within which the si- 
tuation could perhaps be described using only an essentially new concept 
that arises from the limits of our present-day concepts. Thus the theoret- 
ical schemes we considered may be considered as models. This model 
character is expressed in the fact that, when we localized acausality, we 
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used the spacetime description with the variables x and t which are 
defined for free-point particles. The “equality in rights” of these variables 
for small separations in space and time is subject to question. 

However the logical consequence of these models is supported by the 
fact that we used the space-time description only as a device for guar- 
anteeing that macroscopic causality would not be violated. Then we 
returned to momentum space and in this manner constructed an acausal 
S-matrix which was similar to the original causal matrix $. The fact that 
the acausal schemes we considered may be treated as models makes the 
experimental verification of facts which could be used to show the inad- 
equacies of the usual local theory expecially important. 


47. EXPERIMENTAL CONSEQUENCES OF 
LOCAL ACAUSALITY 


An important consequence of local acausality is the changes that occur in 
the analytic properties of the local scattering amplitude T (œ). These, in 
turn, cause changes in the dispersion relations and those between 
asymptotic cross sections (see Section 30 and 38). These consequences of 
acausal theory can be investigated experimentally. In order to be able to 
verify the dispersion relations however, they must contain only ex- 
perimentally observable quantities. 

The dispersion relation for the forward elastic-scattering amplitude 
for the scattering of x-mesons off of nucleons has these necessary proper- 
ties. Therefore, only this type of scattering is suitable for experimental 
verification of such an essential part of the theory as microcausality. The 
corresponding dispersion relations were given in Section 30. Experiment 
and theory can only be compared for the dispersion relation for charged 
mesons, i.e., for the amplitudes D,(w) and D,(w) [see Section 30, 
Equations (30.32’) and (30.32”)]. 

The point wo at which the computations are usually made is the meson 
rest energy, so that wo =m, D,(w)=D, (m). If the absorption part of the 
amplitudes 4, and A) in Equations (30.32’) and (30.32”) are expressed 
using the optical theorem [see (XIII.4)] in terms of the total cross sections 
c+ and o_, then these equations may be written in the form 
f? k? 1 
M œ? — (m?/2M)* 1 — (m/2M)? 





D, (w) = D, (m) + 
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k (do a , , 
+ m| k olo (c, (w) + o- (œ), (47.1) 


f? 2k? wM 1 








PaO = M ao? (m? 2M)? m? 1 (m/2M} 
Kw 7 dw 1 
4r f k EE (w) — O_ (a')). (47.1/) 


where k? =w? —m?, where k is the meson momentum and the constant 
f? =(g7/4) (m?/M?)=0.081 and is measured experimentally from the 
scattering of low energy z*-mesons. 

The biggest problem in comparing theory with experiment is that of 
extrapolating the total cross sections g+ (w) to very high energies. This 
extrapolation is necessary for the evaluation of the integrals in (47.1) and 
(47.1'). The extrapolation equation is of the form 


b 
O, =at (47.2) 


The constants a and b and the index n are selected experimentally (see 
[111-114]). The first experiments to determine the real part of the scat- 
tering amplitude for very small angles D(@) were made in Dubna [115, 
116]. The experimental data was analyzed and compared to the theoret- 
ical dispersion relations (47.1) and (47.1') using (47.2) in references 
[111-114]. 

The most complete set of data to date is given in [114]. Figure 47 shows 
the experimental data (dotted line) for the relationship between the real 
and imaginary parts of the amplitude «= D (w)/A (œ) as a function of the 
energy of the mesons k. The solid line is the theoretical curve obtained 
using Equations (47.1), (47.1') and the experimental formula (47.2). This 
figure shows that the dispersion relation obtained experimentally agrees 
well with the theoretical result obtained on the basis of local (causal) 
theory up to values of the x-meson momentum of k = 20-26 GeV sE 

We now turn to the additional terms that result from the local acausal 
amplitude (see Section 38). According to Equations (38.13) and (38.13’), 
we can write 


Dai (@) = D, (w) + Y, (Œw), (47.3) 
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Da2: (0) = Dz (a) + y2 (0), (47.3) 


where Da (wœ) and D,2(w) are the acausal amplitudes; the amplitudes 
D, (œ), D2(@) are given by (47.1) and (47.1’); and y, and y, denote the 
additional acausal terms. The necessary transformations of Equations 
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Fig. 47. The ratio « of the real and imaginary parts of the scatttering amplitude. The 
solid curves were calculated from the dispersion relations (47.1) and (47.1) 
using Equation (47.2). 


(38.13) and (38.13’) are carried out in Appendix XVI [Equations 
(XVI.11’) and (XVI.12)] and the result is 


w? — m? 
Yı (@) = ——_— f(a) (0, (M) + o- (M)), (47.4) 
! 4r, / M? + m? 
T o-—m_ w bie , 
Wr o) =- Vy gt Pl) Ds ( )+ D_(M)), (47.4) 


t Note that w denotes the meson energy in the laboratory reference frame. 
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where o+ (M) is the total cross section for œ = M, and D+ (M) is the real 
part of the amplitude at the point a=M. 

For w 2M, these corrections which were added to the real part of the 
scattering amplitude D(w) become very important and, therefore, they 
would be observed in experiments if the elementary length a=1/M was 
greater than 10715 cm. 

We now turn to the experimental verification of the equality of the 
total asymptotic cross sections (see Section 30). The total cross sections 
for pp, pp, z*p, K*p collisions from references [117, 118] are shown in 
Figure 48. From this figure, it is obvious that the equality predicted in 
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Fig. 48. The total scattering cross sections for the processes pp, pp, K?* p;x*p asa func- 
tion of momentum (from data ın [117, 118)]). 


local theory [compare (30.36), (30.37)] does not occur for particle 
momenta up to 20 GeV s~!. Whether this difference between experiment 
and theory is due to the fact that large enough particle energies have not 
yet been reached, or whether it ought to be interpreted as a deficiency in 
local theory remains an open question. Some of the curves (K *p) seem to 
indicate that the curves tend to different limits. Further investigation of 
the limits of the cross sections represents an important problem for future 
accelerators. 

Quantum electrodynamics is the classic example of a local theory. 
Therefore, in the past few years much effort was put into searching for 
deviations from the predictions of this theory. One of the more im- 
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portant ways of checking the validity of quantum electrodynamics is 
that of measuring the Lande g factor which defines the relationship 
between the magnetic moment of a particle and its spin. For the electron 
and u*-mesons, this factor is close to 2. The experimental data for 
u-mesons may be summed up by the Equation [121] 
a 2 
319 — exp = 319 — Auheor + 0,6 (5) , (47.5) 
where «?/hc is the fine-structure constant. The divergence given by (47.5) 
from the values predicted by local quantum electrodynamics may be 
formally stated in terms of changes in the -meson distribution function 
D,(q) (where q is the momentum transfered). We first replace the local 
function D,(q) by the nonlocal function D,(q, a) using the equation 


Š 1 ” 4 1 
D = —_ >D > = D, a . 
e(a) Paw (4, a) = D,() poe 


(47.6) 


Nonlocal nature is introduced into this equation by the last term which 
contains the small length a («u «1/m,,) where m, is the z-meson mass." In 
order for theory and experiment to agree, the length a would have to 
be <10714 cm. 

Experimental data show that the Lande g factor is 2 for electrons 
and this corresponds, to a high degree of accuracy, to the value for 
u-mesons. It is only in terms of the order of («/z)? that differences appear. 
However, in the region in which such high-order terms become important, 
quantum electrodynamics ceases to be a closed theory and strong inter- 
action effects must be taken into account. In the case when intermediate 
bosons exist, weak interactions must also be taken into account [121]. 
In investigating the validity of quantum electrodynamics, (e*e~) and 
(uu) pair production for large 1:omentum transfers were studied 
[121], as well as elastic scattering of electrons and mesons off of protons 
(with a momentum transfer q of up to several GeV/sec) [145]. The basic 
results of these experiments showed that the predictions of local quantum 
electrodynamics hold up to values of a< 107 !* cm. 

It is significant that within these limits, there do not appear to be any 


t This estimate was introduced ın [122] and ıs arbitrary because (47.6) can only be used for 
estimating the possible nonlocal effects. 
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differences in properties of u-mesons and electrons, not counting the 
great difference in their mass (m,/m, ~ 200) [145]. This fact indicates that 
the mass difference of the z-meson and electron must be related to space- 
time regions on the order of 107 +5 cm or smaller. Essentially this estimate 
of the scale of the nonlocal nature of a, which is based on studies of 
np- scattering given in [144], leads to values of a=1/M x10716 cm. 
Therefore, it becomes probable that the mass of the particle is defined, not 
only by strong electromagnetic interactions, but also by weak interac- 
tions which evidently play a fundamental role [144-146]. We recall that 
the length a that is characteristic of weak interactions (see Section 22) 
is equal to a= Arp z 107 1° cm. 

Modern local theory leads to the well-known CPT theorem according 
to which the use of the operations of charge conjugation C, spatial 
parity Ê and time reversal Î does not change the state of elementary 
particles. Formally, this is written as 


CPT =1. (47.7) 


The operation CP is called the operation of combined parities. This im- 
portant relationship between the three operations was already shown in 
the fact that in the Feynman diagram technique, antiparticles could be 
considered as particles that move in the opposite direction in time [127]. 
A deeper basis for Equation (47.7) lies in the proper relation of the statis- 
tics of a particle to its spin [128]. In the axiomatic approach to quantum 
field theory, the CPT theorem can be derived from local causality, from 
the positiveness of particle energy and Lorentz invariance [129]. 

If the CPT theorem were violated, this would imply that the basic 
principles of local theory had been violated and this could be related to 
possible changes in the principles of causality in the microworld and, 
more generally, in the geometry of the microworld.t However as we have 
seen, the invariance of the CPT theory has been verified to great accuracy. 
One consequence of this invariance is the fact that the mass of a particle 
m and its antiparticle m are equal. The difference in mass of the Kz- and 
Ks-mesons has been measured very accurately, and these measurements 
show that 


my — Mk 2 
—_—— |} < 10714. 








mk 


t Compare B. A. Arbusov. Preprint 71-26, J. Ph. H. E. Serpuhov (1971). 
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This result shows that Equation (47.7) is valid to a high degree of ac- 
curacy. 

Another of the consequences of the CPT theorem is that the lifetime 
of a particle t, and that of the antiparticle t are equal. The mass of 
unstable particles is a complex quantity ; m=m,+i//2 where À is the 
disintegration constant. Therefore the fact that t and 7 are equal follows 
from the CPT theorem in the same way as the equality of the masses 
m and m. The data obtained to date on the disintegration times of 
z*- and 2” -mesons gives 


+ 


*__ 1 = 0.064 + 0.069, 
T 


i.e., the lifetimes are equal to the degree of accuracy that is presently 
attainable [149]. 

Another piece of data that supports the validity of the CPT theorem 
is the Landé g factor, or rather the quantity (g — 2) for u~- and u* -mesons 
[145]. These give (g —2), =(1162+5) x 1076 and (g—2)_ =(1165 +3) x 
x 1076. Thus we may consider CPT = 1 for the time being. The combined 
partities CP pose a more complicated problem. A few years ago it was 
discovered that this parity was not conserved during the disintegration 
of long lived K,-mesons (K$-mesons) into two z-mesons [123, 124]. If 
K? denotes the K°-meson state and K° denotes the anti-K °-meson state, 
then the short-lived K-meson state (K9) is 


1 5 
Ks =— (K? + RS, (47.8) 
We 
and the long-lived state (K9) is 
1 x 
K, =— KORY. (47.8') 


The first of these states is even with respect to the operation ĈÊ, and the 
second is odd. On the other hand the state of a pair of mesons (x* 27) is 
even with respect to CP. In fact the parity inversion P changes the 
sign of the 2-meson state because it is pseudoscalar and it changes the 
direction of the radius vector which joins the two mesons. Because of 
this, the wave function which describes the state of the meson pair 
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acquires the factor (— 1)! (here / is the orbital momentum). Under charge 
conjugation C, the interchange ntn” takes place and, therefore, a 
second factor (— 1)! is required. 

In this manner the parity of the state (x*z~) is proved. Therefore, 


the disintegration process 
K >n +27 (47.9) 


is forbidden if CP-parity is to be conserved. In the experiment described 
in [123] in 1964, disintegrations of the type (47.9) were shown to be 
possible, even though the probability of this occurring is 

K >n* +27 

Ks >n +27 
Since then these results have been supported and recently a violation of 
ĈP-parity was discovered during the disintegration of a K-meson into a 
m-meson and leptons in the reactions [126] 


x2 x 107°. (47.10) 


K,>n +u*t +v, (47.11) 

K, ont +u +3, (47.11’) 
and 

K,>-2x +et+v, (47.12) 

K,ont +e +7. (47.12’) 


The asymmetry was discovered in the fact that reactions (47.11) and 
(47.12) appeared to be prefered to the reactions (47.11’) and (47.12). 
However, because of the symmetry of the K,-meson with respect to the 
operation CP, reactions (47.11) and (47.11’), and correspondingly (47.12) 
and (47.12’), ought to have occured with equal frequencies. The violation 
of CP-parity could either indicate that the CPT theorem is violated, or 
that the physical phenomena are not invariant with respect to time 
reversal operations T, i.e., the future is not symmetric with the past. This 
asymmetry is well known in macroscopic phenomena and in everyday 
life, but it is most unusual in mechanics or in field theory [130]. 

The extent to which CP-parity (and, consequently, the asymmetry of 
time) violation may be related to the problem of causality and geometry 
in the microworld depends to a great extent on the degree of generality 
of the violation of CP invariance.t So far no phenomena, except for 


t See [132, 133]. 


278 SPACE AND TIME IN THE MICROWORLD 


the disintegration of K-mesons, have been discovered in which CP- 
parity is not conserved. This fact hinders the investigation of CP non- 
conservation from the geometrical point of view. 


48. EXPERIMENTAL RESULTS OF MODELS 
WITH THE “EXTERNAL” VECTOR 


We now turn to the other class of models which, in our classification, 
we refered to as the class of models having an “external” vector n. In 
these models, in contrast to the local acausal case, Lorentz invariance is 
violated and it becomes in some sense an approximation. This departure 
from Lorentz invariance depends either on the selection of a coordinate 
system, as in some of the models of stochastic space (see Section 41), 
or on the violation of the homogeneity and isotropy of space-time which 
is characteristic of models with a discrete or quantized space-time (see 
Section 42). 

In Section 44 we considered zero-point fluctuations of the scalar field 
(x) which gave rise to fluctuations in the metric tensor g,,. In the usual 
understanding of vacuum, these fluctuations as well as those in the 
vacuum charge do not violate the homogeneity or isotropy of space- 
time because, at any point in the space-time continuum R,(x), they 
occur in an identical manner. However, we may also regard physical 
vacuum in the older way according to which the vacuum is a reference 
frame singled out in our Universe. Although these assertions may seem 
improbable, they may not be logically excluded from these considera- 
tions. 

In particular, it was noted in [105] that as the scale size increases, the 
number of reference frames which can serve as inertial frames decreases 
significantly. For particles moving within the atomic nucleus for ex- 
ample, the acceleration reaches values of 10°? cm s~ 2, for macroscopic 
systems, the value is of the order of 108 cms~?, for planets, 1 cm s7? 
and for stars or galaxies only about 1078 cm s™?. Therefore, although 
all macroscopic systems may appear to be inertial for atomic systems, it 
is difficult to find inertial frames for large systems, other than those of 
the stars or galaxies. Remembering the well-known expanding universe, 
it would be more correct to consider reference frames as locally at rest, 
but changing slowly as one goes over to more distant galaxies. It is 
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highly probable that the “most inertial” reference frame could be that 
which is at rest with respect to the “relic” radiation that was recently 
found in the universe. 

Let us now return to the microworld, and for simplicity consider 
elastic scattering. In the usual local theory, this scattering is completely 
defined by the scattering amplitude A(s, t) which depends on two in- 
variants, a= P? and t=q? (P is the total momentum of the particles and 
qis the momentum transfered during scattering). If there exists some time- 
like “outer”, unit vector n, then the scattering amplitude A may also 
depend on the two invariants «=a(n, P) and B=a(n, q), where a is the 
elementary length. Therefore, 


A= A(s, t, a, B). (48.1) 


We now assume that the laboratory reference frame is almost at rest 
with respect to some prefered reference frame (prefered because of in- 
homogeneities in space-time, “most inertiality”, etc.), so that in the lab- 
orabory frame n=(1, 0, 0, 0) and in the other, a=aE, B=agy. We shall 
now consider elastic scattering in this reference frame. In particular, if 
the center of mass of the colliding particles is at rest, then x=a[(E 9+ 
+m)m/2]''2, B=0 and m is the particle rest mass. Thus scattering in the 
laboratory frame and in the center-of-mass frame can be different. This 
difference can of course only become observable for very high particle 
energies (or sufficiently large momentum transfer). In the opposite case, 
axB~x~0 and A~A(s, t, 0, 0). 

But is such an experiment possible? Very accurate data exist for the 
reaction K?-+2* +n” for two energies. According to the local theory, 
the probability of the disintegration dW is a function of the two invariants 
s= P? and r= Pp, = P?— Pp,, where P is the four-dimensional momen- 
tum of the K$-meson and p, and p, are the momenta of the n+- 
mesons: 


dW =dWis,r). (48.2) 


This probability can be converted in the usual manner from one reference 
frame to another and in particular, if it is known for the meson at rest, it 
can be computed for moving mesons by simple recalculation of the 
invariants s and r. The criterion used to show the validity of such a trans- 
formation, based on the usual kinematics of the theory of relativity, is the 
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invariance of the rest mass of the K°-meson. Experiments performed at 
energies of the K$-meson of E,=1 GeV [134] and E)=10.7 GeV [135] 
show the spread Am, of the possible values of the meson rest mass m, 
in the first case is ~ 0.7% and in the second is ~ 1% for the same average 
value of m,. Thus these experiments show that relativistic kinematics in 
the region of a few GeV gives good agreement to an accuracy of ~ 1%. 

In the physics of the last century, conservation laws had a fundamental 
meaning. In present-day theory however, symmetry and group proper- 
ties have a more fundamental meaning. From this point of view, the 
conservation laws are defined in terms of symmetry. Thus the conserva- 
tion laws of energy, momentum and angular momentum are con- 
sequences of the homogeneity and isotropy of space-time. Violations 
of the homogeneity and isotropy would lead to violations of the con- 
servation laws. We recall here that in the general theory of relativity in 
which the inhomogeneous Riemann space is used, the local conservation 
laws of energy and momentum do not exist. 

It would be interesting to determine to what degree of accuracy and 
reliability the laws of energy and momentum conservation apply to 
the world of elementary particles and, in particular, to regions of high 
energies. This proves a difficult question to answer because the signifi- 
cance of these fundamental laws is assumed to be obvious and no ex- 
periments have been set up to verify them. One must remember that the 
possible violations of laws could be a result of violating homogeneity and 
isotropy in the microworld space-time, but there is hardly any basis for 
making the concept of homogeneity and isotropy of space-time a subject 
of faith in physics. 

And what information is it possible to obtain from modern high 
energy experiments? It turns out that the most accurate results are ob- 
tained from studies of elastic scattering of protons. The results show that 
the relativistic conservation laws of energy end momentum for elastic 
proton scattering 


Pı + P2 = Pi + Ph, (48.3) 
E(p,) + E (p2) = E (pi) + E (p2) (48.3') 
(where p, p2 are the proton momenta before impact, pi, p} are the 


momenta after impact, and E =(p?°+m?°)!? is the proton energy) are 
satisfied in the region E =2-10 GeV to an accuracy of about 3%. The 
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corresponding wavelength in the laboratory reference frame is A= 10715 
cm. Furthermore, we can consider as sufficiently well determined that the 
sharp discontinuous violations of kinematics (48.3) and (48.3’) do not 
exist at the level of 10%, but that they could exist at the level of 3% [136]. 
However, there are as yet no experimental data on this subject. 

If any inhomogeneity exists in space-time on the ultrasmall scale of 
order a, then the expected violations of the conservation laws would be 
significant, 4E~hc/a, Ap~h/a, although they could be rare. Such a 
situation exists in the theory of “quasi-particles” that was discussed in 
Section 43. Obviously, it would be difficult to distinguish between such 
violations and those which result from processes in which neutral 
particles take part. In the case when some momentum and energy is lost, 
it would be natural to interpret this as being probably due to the forma- 
tion of a neutral particle. Thus it would be more interesting to observe a 
spontaneous increase in energy and momentum under the conditions of 
sufficient shielding from external neutral radiation. 
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APPENDIX I (TO SECTIONS 14 AND 16) 


Singular Functions in Field Theory 


The function D(x) in (14.4) and (16.3) is derived from the four-dimen- 
sional integral 


a eee Lt 
(Qn) ke +m?” on 
where d*k=dky d°k, kx=kx—kot, k*=k?—k? for bosons. For fer- 


mions we shall use the vector p instead of k. 
The integrand in (I.1) has poles at the points 


ko =+, /k?+m=+0 (1.2) 


or for fermions, at the points kọ = + (p? +m?)!’? = + E. Therefore, the 
result obtained by integrating (1.1) over dk, depends on the choice of 
integration contour in the complex plane. The various possibilities are 
shown in Figure 49. The function D(x) corresponds to the contour Co 
in this figure. 

We now list the various singular functions that can be obtained from 
the integral (I.1) depending on the integrating contour chosen. 

1. The function D(x)— “commutative function” : integration contour 
is Cy and the function has the form 





= x SIN oF 
D(x) = np fe <a d3k 
= ein for sag ENE ee a}. (1.3) 
2n 2 m/x? 


where x? =t? —x? =t? —r? (with c=1), J, (z) is the Bessel function, and 


a(g=1, €>0, e(H=+1, €>0, 


O(é)=0, &E<0, e=-1, & <0. (1.4) 
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D(x) 











Caav 


D(x) Dret (x) 
Daav(x) 


Fig. 49. The various integration contours in the complex plane ky =u + iv that can be used 
to compute the singular functions D, De, Ds, Dre D 


ret? ““adv- 


2. The function D,(x) is “the causal function” ; integration contour is 
C. This function has the form 





= 1 2 m 2 2); 2 
D.(x) = zz 5(x’) aoe ) [ys (m./x?) — iN, (m /x?J] 
+ 0 (x2) —"— K, (m\/x). (1.5) 


4n? Je 


Here J, (z), Ni (z), K, (z) are the usual Bessel functions. For m=0, the 
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function D, becomes 
1 
D Le ôt 2 : ; 
(x) Tn (x*) (L6) 


where 


ôt (z) = 





. 1 1 
iœ (z + i£) d aar 
ak p 27 z + ie (Ey 


le 


for e>oo. Similarly 





2n z — ie 


1 m 1 1 
oa Z| vee do = — (1.8) 
0 


for «00. This equation is equivalent to 
il 
ôt (z) = o +- 11. (1.9) 


which is more useful for integrating along the real axis if the term in 1/z 
is the most important in the integral near z=0. 

3. The functions D + (x), which are obtained by integrating along the 
contours C* and C7, have the form 





D (x) = {D(x) + iD, (x)}, (1.10) 
where D, (x) is the function obtained by integrating over the contour C; : 
2 K 2 
BY PAIRING) e i. (1.11) 
2n? m, /x? 
D, (x)= 
2 N 2 
He NE) sips ao 1.11) 


27? mx? 
4. Lastly, the functions D,e (x) and D,4,(x) are obtained by integrat- 


ing along the contours C,e and C,4,. These are equivalent to the Green’s 
functions for the inhomogeneous equations 


(0O? — m°) Dra (xX) = — 5(x), (1.12) 
(I? — m’) Diay (x) = — ô (x), (12% 
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_ D(x) for t>0, (1.13) 
Dre (x) piei t for t< 0, (1.13) 
_ $0 for t>0, (1.14) 
Daa) = ae for t<0; (1.14’) 


D,.,(x) is the retarded Green’s function and D,a is the advanced Green’s 
function. 

The corresponding functions for a four-dimensional field are obtained 
by operating on the function D(x) with the operator (7,0/0x,—m). In 
particular, the commutative function for a fermion field S (x) is equal to 


ô 
S(x) =— (1,2 -m)ow, (1.15) 


The expression for the causal function for a fermion field is 


S. (x) = — ( m) D. (x). (1.16) 


Ya “Ox, 


Finally the retarded function S,,,(x) and the advanced function S a (xX) 
for the fermion field are 


ð 
Sret (x) =m (> ax, —= m) Dyer (x), (1.17) 
G + 
Saav (X) = — | Y, T m } Daay (x) (1.17) 
and they satisfy the inhomogeneous equations 
ô 
(n. 7A — m) Siei (x) = — ô(x), (1.18) 
fi 7 
(n rhe m) Saay (X) = — ô (x). (1.18) 
Xu 


The derivations of these equations may be found in textbooks on 
quantum field theory [37, 38]. 


APPENDIX II (TO SECTION 15) 


The components of the spinor u} (p) for E>0 may be written as 
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r=1 r=2 
u,=N, u =0, 
uz =0, u,=N, 
TI* II.1 
m Pz ge! he N (L1) 
m+E m+E 
H Pz 
——N, = ? 
Pe MRE Mente 


where 


1 m\ 1/2 . 
N = — (1+2) , I =p, + ip,- 


V E 


From this we obtain 


4 
du* ôu J už ðu, 1 ( =) m?p? 
= =-(14+— am 
Op. OP. op, Op, 8 E E 


a= 




















+3042) ali p; g 
2 E) E (1 + m/E)? E* 4 
m? pe 2p? mp? 





x — — 

(1+ m/EP ES (1+m/E) E (1 +m/E) E’ 
p "h | 1 (E? — m? — pz) pz 

(1+ m/E) E5] 2 ES 


(+2) (1 P] 1.2 
x E TSE R (II.2) 








Note that 
i pz dQ = $np’, fe dQ = $np*, (11.3) 
and so 
1 
L (au* du an? for p<mc, (1.4) 
M = — dQ = 
4n } Op. Op, 11 F 
3p for p>mce (IL4/) 


or 
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1 
a for Ek, (IL5) 

m 4m Po 

m(a =)= 1 4m 1 m 
eas * aif us I.5 
mon” FP ae 


The integral of M (&) has the form 


L, (=) = je (2) £? dë ZG z), L6) 
o 4 Po 


For m/p)— œ this integral is equal to 


ss oaf 2 (6) E2 dë = 
ne- (č) č keg (I.7) 
0 
and for m/p,)-0, to 
m/Po œ 
me (peng Ep 
B()=z5 E (&) č dé + z> 23 pe [s (¢) de. 


m/po 


(11.8) 


The first integral in (II.8) tends to zero as (m/py)°. The second integral 
in (II.8) tends to unity and gives a contribution of the order of (1/p,)? to 
I,. This is combined with the contribution from the integral J, in (15.9). 


APPENDIX III (TO SECTION 16) 


The example referred to in Section 16 is as follows. Let the field p(x) 
(16.13) have an amplitude A (k) that is equal to 


We compute w(x, 0). On the basis of (16.16) we have 


exp (— (k — k,)*/2b? + ikx) 
w 


d?k 





ọ (x, 0) =C | 
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aye | exp(— (k — k,)?/2b? + ikx) Ph 
w 


b2 2 
= {1 exp(- = + ik) 
2 
Co ( b?x? 
+ — exp| — 
%2 


where we assumed that |k; —k,|> b. Therefore, 








+ kx), (111.2) 


2,2 


b?x? i b?x 
dp =c; ep sika) +c, exp( - 





+ kax). 
(II.3) 
Using (16.24) we have 
w(x, 0) = 7 (ôp*p + p*ôg) 
=e hs? {iat + lesca cos (Akx + a) 


@, wi 


2 

cc c 

4 e102 cos (Akx + at, 
w 


2 W2 





(111.4) 


where 4k=k,—k,, a=arg(c,/c,). Assuming that w, > œw, we get 


C2 


2 
ow om erm EL 4 
Cy 


@, 








cos (Akx + a|. (II.5) 


For |c3/c,|> 1, the quantity w(x, 0) oscillates. Note that, because of the 
assumption that |k, —k,|>0, the density is not strongly localized (in the 
small). The oscillations have a period of the order of 1/|k.—,| in space, 
and of the order of 1/|@.—@,| in time. 


APPENDIX IV (TO SECTION 20) 


The Absorbing Diaphragm 


We investigate the solution of Dirac’s equation for a particle of mass m 
and energy E which moves in the field of a strongly absorbing diaphragm. 
We must assume that the interaction between the particles in the dia- 
phragm and those we are considering is covariant. For definiteness, we 
assume that this interaction is of a vector nature. 
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Let the fourth component of this interaction be V. Then Dirac’s equa- 
tion gives 


(op + Bm) y =(E— V)y. (IV.1) 


Assuming y = {el we obtain for g 


` 1 A = 
fei E-VEM a} e=(E-V—-m)¢@, (IV.2) 


where o is the Pauli matrix and 


= t x p= G E 7 (IV.3) 


In regions in which V =0 (outside of the screen but inside of the aperture) 
we have 


Po = (E* — m’) o. (IV.4) 


For simplicity we shall restrict ourselves to two dimensions, and we 
assume the beam is directed along the Oy axis. Noting that 


8? 0? 
2 Fes y2 Saf ee ate, 
p (= z) 


ọ can be written in the form 


ọ ~ exp(i(k,x + k,y)), (IV.5) 
E =k +k +m =k +m, (IV.6) 
E=+ Jk +m’. (IV.6’) 


In the region outside the screen, where V #0, Equation (II.2) has the 
form 

Pe +[(E-VyY —m]9=0 (IV.7) 
or 

Vo + [(E— V} — m7] 9 =0. (IV.7’) 


If the energy V is real, there are two possible cases. These are: 
(a) V<0 (attraction); in this case the quantity [(E—V)? —m?]=q? = 
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=q +q} >k? and the solution has the form of a wave 


p ~ exp(i(q,x + q,y)). (IV.5') 


For E<m bound states may occur. If V< —2m, then bound states in 
which electron and positron states become interchangeable can also 
occur. 

(b) V>0 (repulsion); in this case the quantity [(E — V)? —m?] is 
positive for sufficiently great values of E and so the solution may again 
be written in the form (IV.5’), but with a different value for the wave 
vector qx, qy. For (E—V)?—m?>0, the particle passes over the barrier. 
For (E—V)?—m? =/? <0, 42 +14? =/?, the wave function has the form 


p = exp(+ A,x + Ay), (IV.5”) 


Inside of an infinitely large barrier we must take the solution which 
corresponds to a damped wave 9. 

Finally, for large V (V >2m), the barrier once again becomes trans- 
parent. This corresponds to the overlapping inside of the barrier of levels 
m and —m, i.e., to a sufficiently large polarization of vacuum so that 
the field may form positron and electron pairs. 

We now turn to the case of interest in which the purely imaginary part 
of the potential is absorbed. Assuming V =iW we obtain 


V?ọ + [(E? — m?) — 2iEW — W°] 9 =0. (IV.8) 
From this equation it is obvious that for W> œ it assumes the form 
V’ — W° =0 (IV.8’) 


and, therefore, we have an exponential solution with real indices 4,, 
Ay, AZ +23 =W?>0: 


gy ~ exp(+ A,x + A,y), (IV.9) 
For large W (larger than 4,, A,), these waves are damped very quickly 
inside the screen. Note that Klein’s paradox does not arise in this case 
and the solutions c+m cannot be interchanged. 


APPENDIX V (TO SECTION 22) 


A. The space-time structure of a nucleon may be formally defined by 
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the vector T, [see (22.3)] which depends on two variables, X= y-F(x, 
+x) and x=x,—x;: 


r, =T,(X, x). (V.1) 


The function of the two variables I, (X, x) is derived using perturbation 
theory. We compute the vertex T, for the interactions that are described 
by the Lagrangian 
W = giWysti, + Pyst} + ie 
~ Op . 0@;) > esz Aa 
x4: -7 — b2 a A, +~W(1 + 13) yAn 
Ox Ox 2 


u 


A (V.2) 
where the operators W and ý describe a nucleon field (“bare” protons 
and neutrons), the operators Q, and @, describe the pseudoscalar field of 
charged mesons, and 7,, T3 and t, are the usual isotopic spin matrices. 
The first term describes the ‘‘strong” interaction between the nucleon and 
meson fields, g is the constant of this interaction. We shall formally also 
consider it sufficiently small so that perturbation theory may be used. 
The second term describes the interaction between a meson field and an 
electromagnetic field. This field is characterised by the vector potential A ae 
Finally, the third term describes the electromagnetic interaction of a 
“bare” nucleon. For a neutron this term is equal to zero because in this 
case 4(1+73)=0. 

We now compute an element of the scattering matrix for the scattering 
of a photon on a nucleon to the first order in e, and to the second 
order in g. We must first calculate the average over meson vacuum of 
the interaction energy W, taking into account terms of the order of e,- eg., 
and eg”. Using the usual Feynman diagram method, we obtain 


<0| W 10> = | J, (x) A, (0) dix + | J, (1, X2 y) Ay) 
x d*x, d*x, d*y, ; (V.3) 
where 


J,() = - D(x) (1 + 03) yy), (V4) 


J, (Xi; X2, Y) = ieg7 (x2) [yss (x2 — X1) Y5T3 X 
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x| Detra = PHY 
oD. — 
~ D, (x; — y) aa W(x,). (V5) 
Vu 


The current (V.4) is due to the charge of the “bare” proton, and the 
current (V.5) is due to the charge of the meson atmosphere around 
a nucleon. The latter current corresponds to the Feynman diagram shown 
in Figure 22b. In this equation S,(x,—.x,) is the nucleon causal function 
defined in (14.3) and (1.16); D.(x2— y), D.(x,—y) are the meson causal 
functions. The term in curly brackets in (V.5) is, to an accuracy of ieg?, 
the vertex of the function I7,(x,, x2, y) to the second order in g. It de- 
pends only on the difference of the coordinates x,, x2, y and, therefore, 
it may be rewritten in terms of the variables Y—y and x where Y=y 
—}(x,;+.x2). Thus from the point of view of field theory, the space-time 
structure of a nucleon is described by functions of two variables. The 
difference between the point of entry x, and the exit point x, takes into 
account the effect of nucleon recoil (changing its position) in going from 
the initial state y(x,)=y;(x,) to the final state ¥(x.)=W,(x2). This 
reflects the dynamic character of the nucleon structure described by the 
vector I, (X, x). The Fourier transform of this quantity F(a, $) is a func- 
tion of the matrix y, and the vector «, 8. In the following it will be 
sufficient to limit ourselves to the terms of F which are diagonal with 
respect to the spinor indices. It is easy to show that these terms have the 
form 


F’, (a, B) = 4, (a, ap, B?) = ©, (a?, ap, B?), (V.6) 


where @ is a function that depends on the invariants «?, «8, B?. Note 
that the quantity Ñ, («, 8) would have exactly the same form if the 
nucleon were a scalar particle with zero spin. 
From particle scattering, using (22.7), we can obtain the value of the 
function Ñ, (a, 8) only for «=q and B= P: 
Č (q, P) = ©, (q?, 0, P?) = ©, (q?, 0, 4M? — q”). (V.7) 


From this we see that the spatial transformation 


Py(X) = f I’, (q, P) e'™* d°q (V.8) 
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has physical meaning only if g?7<4M? (P?~4M7). In fact, the variable 
B (= P) is coupled to the coordinate x = x2 — x1, but not to the coordinate 
X =y—4(x1+-2), and from (V.7) we see that the variable £ in (V.8) is 
a function of the variable «=q. Therefore, we must write 


p,(X) = | I’, (q, 2M) e'%* d°q (V9) 


and restrict ourselves to values of q?<4M7?. 

From this it follows that the spatial distribution p(X) has physical 
meaning only for heavy, almost immovable nucleons and for small 
momentum transfers q. 


B. In order to simplify (22.10): 
J, = Uy (pp) {aq, + bP, + cy, + doggy + Coy Py} u: (pi), 
(V.10) 


note that, from the vectors q, P, and y,, it is possible to construct only 
the scalars q°, P?, Yaqa YuPu, (the scalar qP =0). Furthermore, the 
scalar P? can be expressed in terms of the scalar q? (P?=4M?—q’) 
and the scalars y,qg, and y, P, may be removed using the Dirac equations 

ü (p) („Pa + M) =9, (V.11’) 
where M is the nucleon mass. It follows from these equations that the 


coefficients a, b, c, d, e are functions of q° only. From these equations we 
also have 


(ujlio,)P, lu) = (al — q, (ui), (V.12) 
(uy IO wy |u;) = (u,| 2My, ui P; ju). (V.12') 


These relationships allow us to remove from (V.10) the terms in P, and 
„y Py. Thus the current J, becomes 


Ji, = ui; (Py) {Aqu + By, + Co vgy} u; (p), (V.13) 


where A, B, C are form-factor functions which depend only on q?. 
Using (22.4”) we now obtain 


0 = J „q, = tty (Pp) (Aq? + Byydu + CoyrGudv} ui (pi). (V-14) 
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The factor y,q, is equal to zero because of (V.11) and (V.11’) and the 
quantity o,.9,g,=0 because the matrix c,, is antisymmetric. Therefore, 
the form factor A(q?)=0 and the current may be written in the form 


Ji, x Uy (pp) {B (q’) Yu + C(q’) Sudy} ui (p;). (V.15) 


APPENDIX VI (TO SECTION 23) 


The equation for elastic n-N scattering (23.6) was obtained in the 
following way. Let ®,=®,(k1, k2, k3,..., ks) be the wave function of a 
system of s mesons and nucleons (here k,, k,,..., k, are the momenta). 
Mesons and nucleons can be annihilated or created in this interaction. 
Thus the wave functions ®, with different numbers of particles are 
related by the equation 


(E = Ho) ®, = X Wos ®,. (VL1) 
SEs 


where Hə is the Hamiltonian of the noninteracting particles, E is the 
total energy of the system, and Wss is a matrix element of the energy of 
interaction operator. Of all the possible functions ®,, we are interested 
in ®,(k,, k2)=Ņ (k, p) where k; =k is the meson momentum and k, =p 
is the nucleon momentum. In order to compute these components, we 
first rewrite the system of Equations (VI.1) in the form 


(E = Ho) y TA D WP, , (VI.2) 
st 
E-H) Oo = Do Wee Po + Wea (V12) 
Ss +s 


We introduce the “elementary scattering matrix” 7 (see [41]): 

f = ò+ (E — Ho) W, (VL3) 
where 6* (z) is the function defined in Appendix I by (1.7). Using this 
matrix, Equations (VI.2) and (VI.2’) may be written in symbolic form 

(E — Ho) y = 0, (V1.4) 

=F +Y; (VI.4') 


and by iterating we obtain for the function w the equation 


(E — Hh) Y = ô r Y +OP). (VL5) 
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If we assume (a) that divergent terms are excluded from this equation 
and (b) that the remaining terms tend to zero as N-> 00, then this equa- 
tion may be written in the form 


{E — E(k, p)} y (k, p) = | o«oix, p) y (K', p) d°k’ d?p’, 





(VL6) 
where © (k, p|k’, p’) is the matrix element of the operator 
A Foa 
G = - Ô, (VL7) 
1-—F 
E(k, p) = E, (k) + En (p) (VI.8) 


is the sum of the z-meson energy £,(k) and the nucleon energy Ey(p). 
For real processes E= Ho and the matrix r, because the function 





1 
5* (E — Ho) = =, + ind (E — Ho) (VL9) 


0 


is imaginary, will contribute to the imaginary part of the operator 6. 
Therefore, the operator Ô is complex. This is due to the presence of real, 
inelastic processes. 

We now change over to the center of mass system in which k + p=0. 
Equation (VI.6) may now be written as 


{E — Ex(q) — E,,(q)} Y (Q) = [se qalq) y (q) dq (VI.10) 
or, in the coordinate representation, as 
{E — Ag) — A? (&«)} w(x) = | Fexim w(x’) dx. 
(VI.11) 


where A2(x) and Ẹ?} (x) are the Hamiltonians of the unbound nucleon 
and z-meson respectively. By operating on both sides of this equation by 
the operators [E — H%(x)]+ A(x) and [E + A°(x)]— Q(x) in turn, we 
obtain 


[E2 — K?] y (x)= [uc x’; E) w(x’) d3x’, (VI.12) 
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[Ex — AR 00] y (x) = | V(x x; E) yx) dx. (VLL13) 


These are the equations given in the text as (23.6) and (23.7). 

These equations may also be obtained from the relativistic wave 
function equations for the system of particles @(x,, X2,...5 Xps ++» XN). 
Here x,=t,, Xx is the time and space coordinate of the k-particle (see 
[47]). The analysis that most closely approaches this method is that for 
the simultaneous function ọ (t, x,, t, x2)=Ņ (t, X1, X2) given in [42] on 
the basis of Green’s function theory. The authors obtained an equation 
that they called a quasipotential. The advantages of this new approach 
to equal time equations are that (a) it automatically includes the necessary 
renormalization, and (b) the potential is local and so approximations 
may be written in terms of the scattering amplitude. 


APPENDIX VII (TO SECTIONS 24 AND 29) 


The interaction of fields in modern theory is described by an operator of 
the Lagrange function £(x) which is usually written as the product of 
the operators of the fields being considered. This product must satsify 
the requirements of relativistic invariance, some of the symmetry con- 
ditions, and the requirements that correspond to those of the classical 
theory, if any exist. In particular, for the interaction between the electro- 
magnetic field A , and the electron-positron field y, the Lagrangian Ê has 
the form 


= 1, $ 

È=- 9,0) Ay(x), (VIL1) 
where the current ĴÎ, (x) is equal to 

J, (x) T ey (x) YW (x) x (VII.2) 


The constant in this case is the elementary charge e. For the case of a 
scalar, electrically neutral field 6=@*, the Lagrangian is 


pacer. (VII.3) 


where å is some constant.* 


t Note that Ê~ ¢? leads to a dynamically unstable system and that L ~ ĝ? leads to trivial 
linear equations. 
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For a sufficiently small interaction constant and for a completely 
switched on interaction, the scattering matrix § 


S$ = Texpi fe (x) dfx (VIL4) 


can be written in the form of a series of functionals [see (28.11), (29.22)] 


$= y zf T [L (x,) Ê (x2) L(e,)] dêx, d4x, = dfx. 
n=0 


(VIL5) 


Because the Lagrangian L(x) is the product of the operators of the 
fields @ or w or A, each term in the series (VII.5) itself represents the 
sum of the products of the operators of particle creation a, and the 
operators of particle annihilation a,. The order of these operators which 
transform the initial state of the field Y; into the final state ¥, defines 
the history of the particles (in the Feynman diagram sense). The computa- 
tions of the matrix element S,;=<¥ A8 |Y;> which describes this trans- 
formation may be simplified if the factor in (VII.2) is written in the form 
in which all the annihilation operators are on the right of the formation 
operators. Then the annihilation part of the operators that operate on 
Y, would transform it into a functional of vacuum Yo and the formation 
part of the operators, which operate from the left of the functional ¥, 
would also transform it into a functional of vacuum Fo. Calculation of 
the effect of the “extra” operators reduces to finding their vacuum aver- 
age. 

This process of changing the position of the operators is called the 
process of transforming the 7-products by the normal N-products. The 
validity of this process is based on the easily verifiable equation (see 
(37, 38)) 


T {9 (x) PO = N {e(x) o(y)} + D(x — y), (VILS) 
which holds for operators of a boson field ọ (x), and the equation 
T Ya) P O = N (a(x) PO + ES yas (VIL6) 


which holds for operators of a fermion field Py. Note that, for the com- 
binations of operators such as (y, w) or (W, %), the J- and N-products 
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are equal. The causal functions D,(x) and S,(x) are defined in Appendix 
I and are also called contractions. The vacuum average of the normal 
product is obviously equal to zero. Thus the nonvanishing contribution 
to the matrix element S;; consists of the product of singular functions 
such as D,(x—y) and S,(x—y) and of the limiting, or “bounding” 
operators of particle production and annihilation, which act on the func- 
tionals ¥, and W,.! Therefore, in the simple case of a real, scalar field 
P(x) 

$(x) = f d?k {å (k) Ug(x) + â* (k) UŽ 0}, (VIL?) 


where U;,(x)=e*/(2@)'/?, œ=(k?+m°)™?, the matrix element S,; has 
the form 
Sji = CP, a* (k,) a* (k2)---a* te) | Bron Y2» +++ Vm) 
x Son PD; (X1, X2, +++) Xn) d*y, d*yz -+ d*y,, dfx; dx- 
x d*x,a(k,) a (k3) +a (ki) [Pid (VII.8) 
where S,,, denotes the sum of the products of all operators a* (k) Uj (x) 


and a(k) U,(x) except for the “bounding” operators which are given in 
(VII.8). The function Ê, is equal to 


ÈY, Yas e> Ym) = 3 Us (v1) UE (2) ++ UE, Om)» (VII.9) 
P 


where )’p denotes the sum over all combinations of m-particles in the 
final state. Similarly, 


®;(X1, X25 +06) Xn) = > Uy, (%1) Uy, (% 2)... Urn (Xn) (VILS) 
P 


in the symmetrized function of n-particles in the initial state. From the 
definition of the “bounding” operators, we have 


Spi = [aro Y2, tee Ym) s (yı y2; veas Ym l Xis X25- Xn) 


X D(X, X2, -<-> Xp) dy, d* yy -+ df Ym 
x dfx; dx,- dfx,, (VII.10) 
* Some of the matrix elements may diverge. We shall not describe the methods of regular- 


ization (renormalization) of such elements here. These methods are given in special courses 
on quantum field theory. 
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where 


s (yi, Y2»---» Ym | Xis X25: Xn) = (Pol Sim |Po> (VIL11) 


is the vacuum average of all combinations of particle formation and 
annihilation that describe the history of the field in the interval t} = — co 
to t, = œ in which the state changes from the initial i to the final f. 

From this short outline, it follows that the scattering matrix S may be 
written in the form of a series of functions 


Sn = S (X1, X2, za Xp (VIT.12) 


each of which represents the vacuum average over the n-particle history. 
The division of the particles into those in the initial state (x) and those 
in the final state (y) is not important, as we can see from (VII.9). Each 
function in (VII.11) can represent various physical processes depending 
on what divisions of the initial (7) and final ( f) particle states are pos- 
sible. The function s(x,, X2,...,X,-1) can represent the processes 


sYa | Xis X25 +++) Xn=1)> 

S(Y1 Ya | X1, X29 -1 Xn- 1> Xa—2) | (VIL13) 

5 (V1 V25- Yn-1 | Xn)- 
Not all the possible divisions will describe real, physical processes. For 
example, the first division in (VII.13) would imply that a particle y is in 
a resonant state (n—1) of the particles (x,, X3,..., Xn-1), which con- 


tradicts the spectral decomposition (VII.7) in which only one type of 
particle, which is stable and has mass 7n, is assumed. 


APPENDIX VIII (TO SECTION 27) 


A. Virtual photons may not be considered separately from their source. 
We assume that the source is a particle that is localized near the point 
x=0, and that a quantum jump takes place at a time close to t=0. The 
current of particles formed in such a transition is 


JP = | d°p; d°p, exp[— 3a [(p3 — Py) + (Pp, — p)7 J] 
x exp[— i(kx — kot)] a, (p3) F,u;(p1). (VIIL1) 
The field of the packet of virtual photons that are formed in this transition 


300 SPACE AND TIME IN THE MICROWORLD 


is described by the equation 


1 
A,,(x, t) = Gam J, (x) 
u;F Wu; 
an d? d? f? ui 
| Pa $ Pi ie +m?) 


a? 2 2 
x exp] — 5 [(p; — P)? + (Pp: — pò?] 
—i(kx — kon |. (VIII.2) 


In order to compute the integrals in (VIJI.2), we set 


Pi=S+DP;, P3=nN+P,y, (VIIL.3) 
so that 


Ps — Pı =M- E) +(p;— p), ka (py — p), 
Pos — Poi = E (p3) — E (p4) 
= E,—E,+ VEm- VEE +, 
koz E-E, VE =u, VẸĘ=w;. 


(VIII.3’) 


We consider the packet to be small, but |q,;|=|p;—p;|>//a. Further- 
more, for simplicity, we assume that the radiating particle is a point 
particle, i.e., its mass m> |q,;|. Then from (VIII.2), the potential of the 
packet is 


A, (x, t) © exp |: (kx — kot) — = (x — uyt)? + (x — w| 


a? 
(VII.4) 
This expression was used in the text for the case when u;=0, Uş =u. 


B. Note that the integral 


+c 
I= | | e ?&9 dx dt, (VIIIS) 


where 
® (x, t) = A?x? + Bxt + C7t? + Dx + Et +G, (VIIL6) 
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for 47=4A?C?—B?>0, is equal to 


path 
A 
where 
A? C? 
e 2 2 
Porat pP See ee 
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(VIII.7) 





where 


i, 2 at jai 
Aaah, 


1/ve+v? wv? 
ce-a toa 
v,;-v, u 
s--( p? +4), 


1 
D =i(k—q)+—(2X — uT), 
a 





1 
E = — i(kg — qo) + GLX — uT, u)]. 


(VIIL9) 


(VIII.10) 


(VIII.10’) 


(VIII.10") 


(VIII.10”) 


(VIII.10””) 


and where qo is the difference between the fourth components of the 
vectors p, and p,. For the particular values of 4, A, C, B; 


1 
= {Dept v? + u?) + (ve + v; — u)?], 


2 


a?’ 


A? = 
C= 1 2 2 2 
== zas tv +u’), 


1 
B =— -z (Ys +V; +u), 


(VIII.11) 


(VIIT.11’) 


(VITT.11”) 


(VIIIL11”) 
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we obtain 
X? (X-uTP CŒ 
Us 5a a typ XT 
A? >, B 
tag X uT, u) + -zza CX — uT) (X — uT, u) 
C? A? 
-P (k — q}? — yE (ko — qo) + Im ¥, (VIII.12) 


where Im ¥ denotes the imaginary part of ¥. Because ko =qo Im ¥ =0 
for k=q, then Im ¥ may in general be neglected in (VIII.12). 

The real part of ¥ is a positive-definite quadratic function of the 
variables X and T. We write it out for the special case for a=b, X, =0, 
X,#40, T=0 [here X¥,=1/u(Xu)]; 


X? 1 (v; +v; — u? 
Y=- d =2a|1 +- ], (VIL13 
d? ý | "30a ow) ( ) 
for X,=0, T=0 
2 $ 
Y=- Z, (VIIL13') 
for X=0, T#0, 
272 
pa 7 (VIIL.13”) 
where 
Dp eo 8 2 
p 2220r + ut) + (v; +v; +u) (VIIL14) 





2(v7 + v? + u?) + u? 


APPENDIX IX (TO SECTION 28) 
Let ¥ be a functional of some function which is defined in the space 
R(x) 
Y = Y {u(x)}. (IX.1) 
We vary the function u(x) about the point x such that 
u’ (x) = u(x) + ôu (x) 
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where the variation ĝu(x) differs from zero in the neighborhood of Xo. 
If the new functional can be written in the form 


P = Y {u (x)} = Y {u(x)} + [ruw du(x') dx’, (IX.2) 


then the limit 
SV m PAO) ¥ {ul} 


u(x) — guax+0 du (x) dx 





= F {u(x)} (Ix.3) 


is called the functional derivative of the functional ¥ with respect to the 
function u(x) at the point x. Another, more formal definition is 
ov P (w 0} — Y {u(x} 


= li ; IX.3’ 
du (x) ii E ( ) 





where ôu(x')=£sô(x— x’). In this definition, the localization of the varia- 
tion about the point x is clearly shown. Obviously, the two definitions 
(IX.3) and (IX.3’) are equivalent. 
If u(x)=a(x) is a space-like surface, then 
oy. Y {o'(x)} — ¥ {a(x} 


l 
ôo (x) Pa ôw 





(IX.4) 


where ôw = ôo (x) d3x is a four-dimensional volume contained between 
the surfaces o’(x) and a(x) (see Figure 30). 
If the functional ¥ {u(x)} is defined by the series 


1 
Y {u} = Yo (x) + ave x2) u (x2) dx3 
1 
T x | ves x1) u(x,) dx, 


1 
+ 31 | W(X, X2, X3) u (x2) u(x3) dx, dx3, (IX.5) 


then its functional derivative at the point x will be 


Ov {uj í f : 
buy ” ET W (x, x2) u (x2) dx, 





1 
+5 | von. uce) dx, 
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1 

+ a lve X2, X3) u (x2) u (x3) dx, dx3 
1 

+ a | ve xX, X3) u(x,) u (x3) dx, dx3 


+ = [ves X2, X) u (x1) u (x2) dx, dx, ++. 
3! (IX.6) 


APPENDIX X (TO SECTION 31) 


The Relativistic Wave Function for Two Particles 


We shall investigate the wave function for two free particles which are 
described by the plane waves 


Y (X1, t1; X2, t2) = exp [i (P:x; — E,t,)] exp [i (P2x2 — E,t»)]. 
(X.1) 


This equation has physical meaning only when the points x,, t,, and x3, 
t, lie on a space-like plane. We introduce the coordinates 


z ee = ee } (X.2) 
where a, B>0, «+ f=1. Then 

R 3) 
The total momentum is 

P= pP; + po, E = E, + E. (X.4) 
In terms of these variables, the wave function (X.1) becomes 

W(X, T, x, t) = exp [i(PX — ET] + i(px — £t], (X.5) 
where 

P = Bp, — ap,, e = PE, — ak, (X.6) 


are the relative momentum and energy. 
As the points (x;, t,) and (x3, t2) lie on a space-like plane, we may 
set ¢; =f). Then t=0 and T is the time common to both particles. In the 
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center of mass system, P =0 and so, for t; =t,, we have 

y (X, T, x, 0) = exp [i (px — ET)]. (X.7) 
where the quantity 

E = E(p) = m +p + m +p (X.8) 


is the total energy of the two particles in the center-of-mass system. We 
now change notation and use ¢ instead of T, x as the relative coordinate of 
the particle and ż as the common time. In this notation, the expansion 
of the function y in terms of spherical waves has the form 





y= 5 (21 + 1) P, (cos 9) R; (pr) e7 EP! (X.9) 


but that for zero-spin particles does not differ from the nonrelativistic 
case. In this equation r is the distance between the particles in the 
center-of-mass system 


Ji 
R, (pr) = Ji +u (Pr) (X.10) 


./ pr 
Ji 41/2 (2) is the Bessel function and P,(&) the Legendre polynomial. The 
asymptotic form of equation (X.9) is 
w(x, t)= } (21 + 1) P,(cos v) [a (E) exp (i(pr — Et)) + 
1=0 
+ b,(E) exp(— i(pr — Et))], (X.11) 


APPENDIX XI (TO SECTION 34) 


Resonance States. Nonrelativistic Theory 


We shall consider the scattering of two particles that can have a resonant 
interaction. For definiteness, we assume that the resonance takes place 
in the s-state.t We write the wave function of the s-state in the form 


Et 


1 
g(r, = we) exp( - i=). (XI.1) 


t The calculations for other states are analogous. 
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The function wy satisfies the equation 


d? 
a + [k — k (r)] y =0, (XI.2) 
where 
2mE 
pol. en ee. (X13) 





h2 2, h2 
E is the total energy, k is the wave vector, m is the reduced mass of the 
interacting particles: 1/m=1/m, + l/m, and V is the potential energy 
of interaction in the s-state. 

We define a special form of V(r) in the following manner (R is the 
radius of the sphere of influence): 





k?(r) =0 for R<r, (X14) 

3 2 2m% , 
k*(r) = kp = ie for a<r<R, (X14) 
k?(r) =0 for O0<r<a (X1.4”) 


(see Figure 50). This form of V(r) allows for the existence of resonance 
levels in the continuous spectrum. 





Fig. 50. The form of the potential V(r) that gives a resonance for an energy of E= Ep. The 
dotted line represents the wave function y for E= Er. 
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The solution of (XI.2) may be written in the form 


yı = alk -R) 4 pe“ kO-R) for R<r, (X15) 
Wy = aeti ETN 4 Be~40-R) for a<r<R, (XL5’) 
Win = Y Sin kr for O<r<a. (X1.5”) 
Matching the solutions on the boundaries r= R and r=a gives 
at+b=a+B, 
a ae for r=a, (X1.6) 


ae~4 + pe^ = y sinka, 


k fi =R, X16’ 
ue“ — Be“ =~ y cos ka ey ( ) 


where A=4(R—a)>0. For Ao (R>œ), we obtain from (XI.6) and 
(XI.6’) the condition 


k 
D,(k)=1 +5 ctgka =0, (XI.7) 


which defines the discrete, steady-state levels for the case of an infinitely 
wide barrier. 

For finite values, quasi-stationary resonance levels arise [28]. In order 
to compute these levels, we write the system of equations (XI.6) and 
(X1.6’) in the form 





2a = Aa + A*b, Ma+M*b=4ysinka, (X18) 
k 

2B = A*a + Ab, Na + N*b =4y7 F coska, (XI.8’) 

where 
ik 
Azi T M = Ae74 + A*e^, N= Ae7^ — A*e^. 
(XI.9) 

From (XI.8) and (X1.8’) we have 

$ A" e72 D_— D, 

= (XI.10) 


e724 D_ + D, 


>| > 
ajala 
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where 
k 
D_(k)=1- 7 ctg ka. (X1.10’) 
Let kg be one of the roots of Equation (XI.7). Then 
dD(k 
>, =| SO (k — kp) +, (X1.11) 
dk Irate 
a) ARE A (kp) (X111) 
dk dein ke l 
where 
k k 2? 
ad= i)o. (X112) 
AR Ar kg 


Therefore, close to the resonance Equation (XI.10) takes on the form 
a A [k— (kr + ô) — ir] 





mae X1.13 
b A* [k — (kr + ô) + ir] ( ) 
where 
A _,, D_(kp) 
T = Im E ert4k, Pw], (XL14) 
D_(k 
be Re| 4 eo 24ka -w| (X115) 
A* A 


Note that close to the resonance, the amplitude of the wave function in 
the inner region y tends to a. 

In fact, close to the resonance 
y 4(A*? — A?) 4(A¥* — A’), 


a sinkpa[M*+N*]  sinkga 
so that for A—>œ and for Ex Ep, y/a—>æ. Particles accumulate inside 
the potential barrier, in the region 0<r <a. 

We now rewrite the solution y, (kr) for r>R in the form of the 
superposition of the initial s-waves y?=(sin kr)/kr and the scattered 
waves u= fei’ /r: 


p= tu. (X1.17) 





AteA, (X1.16) 
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Simple computations using (XI.5), (XI.9) and (XI.13) give 
e FR k —idpk — kg —ilp 
Qik kK+ilgk—Kkg+il,’ 





f= (XI.18) 
where 

Rp = kp + Ôr- 
The scattering matrix S which transforms the incident wave e~ *" into the 
scattered wave e!*" has, from (XI.17) and (XI.18), the form 


k—idlpk—kp—il'p 
k+idgk—kypt+ilp 

The first factor contains the factors that violate causality. This viola- 
tion is due to the fact that the radius of the sphere of influence of the 


potential V(r) is finite. The second factor describes the resonance about 
the point k = kp. 





S (k) = e7? (XI.19) 


(a) The analytic properties of the solution in the complex plane E. By 
considering k? =2mE/h? and assuming, for simplicity, that 2m/h? = 1, we 
may write the solution y, [see (XI.5)] in the form 


Wn (kr) = a exp[— y- E(r — R)] + bexpL./— E(r — R)] 
(X1.20) 


We set 
E = se’, e = |E], e"? = |B), (XI.21) 


On the first, “physical,” sheet of the Riemann surface, we define 
J-E =e!" exp[ż ilo — 7)], 
Re /- E ="? sintg > 0. (X1.22) 


On the second, ‘“‘nonphysical”’ sheet of this surface, we define E by the 
equations 


1/2i(@+ 
J- E= elie 7 


Re, /— E =- e"? sinjo <0 (X1.22’) 
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and we make a branch cut from E=0 to E+ œ which separates the two 
sheets (see Figure 51). In accordance with (XI.5), (XI.20)-(XI.22), the 
wave vector k is defined thus: 








Fig. 51. The complex plane (E). This figure shows the path from a point on the first sheet 
(the cross) to a point on the second sheet (the black dot). 


on the physical sheet 
ik = — J-E= + ie! 2 exp(i s), (XI.23) 
k = "°? exp ( s) Imk >0; (X1.23') 
and on the nonphysical sheet 


ik =~ /ZE =- jg"? exp(i $), (X1.24) 


k = — e"? exp (: s) Imk <0, (X1.24’) 


Thus the physical sheet of the surface (E) is mapped onto the upper 
half-plane of the variable k, and the non-physical sheet is mapped onto 
the lower half-plane. Note that in the stationary state, the component of 
the current that is directed along the radius vector must be equal to zero. 
Therefore, 


b(E) = a(E*). (X1.25) 


Furthermore, from the symmetry of the upper and lower half-planes of 
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(E), we have 
a(E*) = a* (E). (X1.26) 


We first consider the stable levels which, in the preceeding example can 
occur for 4=0 (Ro). On the first sheet we have 


y (r, t) = [a (E) exp (— ¢'/?r) + b (E) exp (e'/?r)] exp(— iEt). 
(X1.27) 


The second solution increases without limit for roo and, therefore, we 
must impose the requirement [the characteristic solution of (XI.2)] that 


b(E) =0. (X1.28) 


This equation defines the stable levels which are defined by zeros in the 
functions on the surface (E) which lie on the real poles E<0. 
Note that on the second sheet, for E<0, we would have 


y (r, t) = [a (E) exp(e!/?r) + b (E) exp(— e"/’r)] exp(— iEt), 
(X1.29) 


so that the levels would be defined by the condition 
a(E)=0. (X1.28’) 
Because of (X1.25), Equations (XI.28’) and (X1.28) give the same result. 


(b) Unstable levels. We consider the levels that are possible for 
complex values of E when Re E>0. We again require that 


b(E) =0. (X1.30) 
Let 
E=E,-iI, (X1.31) 
so that 
exp (— iEt) = exp(— iEgt — It). (X1.32) 


On the physical sheet we have 
ik = — ./—E=— e"? exp[$i(y + 27 — 7)] 


= — 1? exp[4i(y + 7)] = — ie’? ep(i z), (X133) 
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where 
T 
y = — arctg — + 27n, n=0,1, (X1.34) 
Ep 
from which 
k=-k —ix’, (X1.35) 
, 1/2 y T 1/2 wn? , 
k =e cos 5, x =—eE sin 5> (X1.35) 


so that k’>0, x'<0. 
Thus on the physical sheet, the function would have the form 


W (r, t) = a (E) exp[— ik’ (r — R) + x' (r — R)] 
x exp(— iErt — It). (XI.36) 
This solution contradicts the continuity equation because wW(r, t) de- 
creases as r(x’<0) and ¢ increase. 


We now turn to the solution on the second, nonphysical sheet. In this 
case 


k= (- E)" ANER iet/?e i@/2 k=- g1/2eie/2 


and @=y+2n, where y<0, so that kK=k’ + ix” 
wm F y We AA 9 y 
k” = ¢’/* cos F x” = "4 sin 7 (X1.37) 


k">0, x" <0, (X1.37') 
For r> R, the wave function has the form 
yý (r, t) = a (E) exp(ik’r — x"r) exp(— ipt — Tt). (X1.38) 


This solution increases as r(x” <0) increases, and decreases as t increases. 
By assuming — x” = + |x”|, we obtain 


y (r, t) = a(E) exp [i (kgr — Ert)] exp(x"r — It). (X1.38') 
This solution is compatible with the continuity equation and is, therefore, 


considered correct [138]. 
This crossing over from the first to the second sheet of the plane £ is 
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easily illustrated using Figure 51. The zero point E= Ep—il’, indicated 
by a cross, lies below the branch cut. If we remain on the first sheet, 
we can only get from the point E=E,+iW to the point E=E,—il by 
going around the branch cut from the left. The phase of the wave vector 
changes by ef" when we do this, and the converging and diverging waves 
change places. In order to satisfy condition (XI.30) and the earlier de- 
finitions of the converging and diverging waves, we must cross over onto 
the second sheet directly across the branch cut OE. 


(c) Space-time properties of the resonance levels. We define the 
initial state y? in the form of a localized wave packet 


yo = | c(k) = exp[ — iE(k) t] dk, (X1.39) 





sin k 
kr 


which is concentrated around the value k=k,g.' For definiteness we set 


c(k)=c for a 


c(k)=0 outside of this intervals . (X1.40) 


The wave packet (XI.39) now has the form 








c : 
Wr (r, t) = — — exp [i (krr — Egt)] f(r — vt) 
2ikgr 
~ ——exp[—ilker + Eg] f(r + vt), (X141) 
2ikpr 
where the function f (r + vt) differs significantly from zero near 
h 
rtvt= A’ (X1.42) 


We assume 4>R. Then the collision occurs at a time close to 
h 
eae 


We now turn to the scattered wave u, (kr, t). According to (XI.17), (XI.18) 
and (XI.39), it has the form 


(X143) 


t We assume that kp 1s not too close to the origin. 
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kr+å 
u,(kr, t) = e c(k) exp[i(kr — Et)] g(k) 
kr~d 
ee ey 


By replacing the continuous functions, g(x), by their values at the point 
k=ke, we obtain 
c exp [i (krr — Ert) — It] 





u, (kr, t) = a g (kp) 
R 
+A 
: ¢- Ww ; 
x | exp [i(r — vt) č] Pair dé. (X1.44’/) 


-A 
For 4-00, we close the integration contour in the lower half-plane for 
r—vt<0. We obtain 


u, (kr, t) = as exp [i (kgr — Ert)] (— 2il) 
R 
x exp[—i(r — vt) ir] 
= a exp [i (krr — Egt)] exp [T (r — vt)]. 
R 


(X145) 


If r—vt>0, then we can close the integration contour in the upper half- 
plane and obtain 


u,(kr,t)=0, for r—vt>0, (X1.45’) 


which agrees with the results of Section 34. 


APPENDIX XII (TO SECTION 35) 


The unitarity relation for the scattering matrix S=/+iT may be written 
as 

$§+ =1+if -—iT*+ +TT* =1 (XIL1) 
or 

ImÎ=4fÎ+. (XII.2) 
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This operator equation may be written in matrix form as 


Im Toy =} F Tey Te 
Y 


ay “aß ` 


In particular, noting that T,, = T%, for the diagonal term «=f we have 


Im Ta =) Tal. (XII.3) 
7 


We apply this equation to the scattered particle. Then T„, denotes the 
“forward-scattering” amplitude (i.e., the scattering in which the particle 
does not change state), and the sum of terms |T,,|? denotes the total 
probability of a transition from an initial state « to all other states y. 
Therefore }' | 7,,|? is proportional to the total scattering cross section 
o,. A more detailed calculation gives the result 


k 
Im T (0) = —a,, (XII.4) 
4n 


where T(0)= T,,, and k is the particle momentum in the center-of-mass 
system. Equation (XII.4) is called the optical theorem. It is a logical and 
necessary consequence of S-matrix theory in that it expresses the con- 
servation of probability: the departure of particles from the initial state a 
is compensated for by their appearance in other possible states y. The 
unitarity condition (XII.2) is not as necessary as (XII.3) which leads to the 
optical theorem. However if we give up the unitarity condition, we loose 
the mathematical basis for (XII.3). Thus giving up or violating the 
unitariness of the $-matrix leads to a difficult situation. 


APPENDIX XIII (TO SECTION 37) 


In order to compute the elastically scattered wave Y., we start from 
Equation (35.10) where the function G is defined in terms of the function 
Go, (37.13). We begin by considering the first case; the second is simpler 
to compute. We write the four-dimensional 6-function in the form 


ôt (pa + P3 — P2 — Pi) = f d*a expia (p4 + Ps — p2 — P1) 
(XIIL.1) 
and the function D,(g) in the form 
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D.(q) = | d*BD, (B) +, (XIII.2) 


where q=p,—p, OF pa +p. Inserting these expressions in (37.13), dif- 
ferentiating (37.13) with respect to 6/0x and ô/ôx?, and using (35.10) we 
obtain 


Pout (X4 x3) = Uz (x4) uy (x3) +i | dta d*B 


x expi[a(p4 + Ps — P2 — P1) + PaXa 
+ 3X3 — P2X2 — P1X1] 
x {D,(B) exp [iB (P3 — p:)] 
+ D, (P) exp [iB (p2 + Pi)]} 
d?p, d°ps 


x 
2p9 2p$ 








d*p, d*p2u, (x2) uy (x1) d?’x, d*x,, 


(XIIL.3) 


where we assume that the original wave ¥,,,(x2, x,) represents two wave 
packets u,(x,) and u,(x,). We first compute the integral over x, and 
obtain 


it; (P1) 


(XIII.4) 
2p! 


| u, (x,) exp(— ip, x;) dx, = 


Using (35.4), we now integrate over p, and obtain 





| oe exp[— ip, (a + B)] d’p, =u, (~a — $) — (XIILS) 
1 


for the first term in the curly brackets, and 


f "ey exp[— ip, (« — B)] d°p, = u, (B — a) (XIIL6) 
2Po 


for the second term. We similarly integrate over x, and p>, and finally, by 
integrating over p3, we obtain 


3 
[ex [ips (@ + B + x3)] Se = D, (& + ß +x) (XIII.7) 
3 


APPENDICES 317 


and 
3 
| exp [ips (a + xs)] oat = 
P3 
The integration over p, is carried out similarly. By assuming that, after 
integrating the first term in the curly brackets, «= — x, a+ f= — y, and 
after integrating the second term, «— p= —x, («a+ 8)=—y, we obtain 
the final form 





D, (& + x3). (XIIL7’) 


Pout (X4, X3) = Ug (x4) Uy (x3) 


+1 | d*x d*yD, (x4 — y) Dy (x3 — x) D. (x — y) uy (x) u2 (y) 
+i | d*x d*yD, (x. + =) D, (= a ”) 
2 2 
x — 
x o.( 
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. ) u, (x) uz (x). (XIIL8) 


The computations of the function 4*(t—r, A, M) for Aco will be 
reduced to integrals found in tables. When we set 2m?=M?, t—r=e, 
x=me, the integral 


œ 


eive 
A* (e, ©, M) = M? | _.——d : 
(e ) fz ae (XIV.1) 
0 
may be rewritten in the form 
d? 
A* (e, ©, M) = 6° (e) + m 7G 1(), (XIV.2) 


where 


eizx 
I (x) = | (241) dz 
o 


1 z+i . z—i , 
= izx d 12x d 
E í {ie zl 
0 0 


= 5 [- e7*Eti(x) + eEi(— x)]. (XIV.3) 
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where 
e7t 
Ei(—x)=— (= dt, (XIV.4) 
E*i(x) = Ēi — in (XIV.4’) 


These formulas may be found in tables.‘ 
The asymptotic expansion of these functions has the form 














y; Err 1 2! 
Ei(— x)=- l--+— 547--"], (XIV.5) 
x x x? 
oe 1 2! 
Brig) =—in+ fi riie] (XIV.5’) 
x x x 
Therefore, the integral J(x), for x— œ, is equal to 
2 2! 1! 
I(x) = e [1 - nf =el (XIV.6) 
2 x x x 
and, therefore, 
di (x) n 1 
=-e *+0O{—]. XIV.7 
dx? 2° = x3 ( ) 
Using (XIV.2) we obtain 
At (e, o M) = 6° @) + 0( as) + erm. (XIV.8) 
me 2 
Remembering that ô* (e) is defined as 
1 
5* (6) = 5(6) + — (XIV.9) 


[see (I.9)], when scene ?_,00, the last two terms in (X.9) become 
small corrections. Recalling that m=M/(2)*/? = 1/a(2)'/? and assuming 
that e=1—r >a we are led to the result 


1 
a(tmr o=) 
a 


=n sof 2 


alts oj exp(- a, 


2 





(XIV.10) 


t E. Jahnke and F. Emde, Table of Functions, Section V, Dover, New York, 1945. 
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For t—r<2—a, the same equation is obtained except that (t—r) is 
replaced by —(t—r). Then from (XIV.1) it follows that for t—r—0, the 
function A* (t—r,oo, 1/a) is regular and tends to the constant value 


a(a 22) ov (XIV.11) 


a Aa 
APPENDIX XV (TO SECTION 43) 


Equation (43.7) may be derived in the following manner. Let the 
S-matrix, in the coordinate representation, be given by the usual ex- 
ponentials, ordered in time 


S=T exp(i f Lo» ax), (XV.1) 


where L(x) is the Lagrangian and T denotes the ordering in time (cf. 
Appendix VII). 
By assuming 
S=f+iT, (XV.2) 
we obtain from (XV.1) 


feo} 


T -) [rianta dx, --- dx, 
s! 





_ yo} Olti — ty) --- O(t,_, — t) Ê (x4) E(x) 


x dx, ndu =Y fh (XV.3) 
s=1 

We write (XV.3) in its full four-dimensional form by replacing 0(t) by 
the invariant functions 6(nx) where 


nx=ngt—nx, with n?=1, m >00. (XV.4) 


t In this appendix we follow references [100, 101] 
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The result is 


X Î, = Zr foo Xı — X2): O(n, Xs-1 — X;) 
s=1 
x £ (x1) E(x, dx, + dx,. (XV.5) 


As is well known, the dependence of 7, on n in (XV.5) is purely fictitious 
because, for (x;—x,4 1)? >0, 0(t;—1;4 1) is always equal to 0 (n, X;—Xj41) 
and for (x,—x,41)?<0 the function 6(n, x;—x;41) does not contribute 
because the “locality” condition 


[L (x), E(x;-1)]- = 0, (XV.6) 


is satisfied in this region. 
For definiteness we set 





L(x) = 9:¢°:, (XV.7) 
where œ is a scalar field, and apply Fourier transform (XV.5) assuming 
1 i, eit (nx) 
O(n, x) = — — dt, (XV.8) 
2ni t — ie 
Lip = feer (x) dx 
g Lan i 
= < fow — ky — k — k3):@(k,) ô (k2) 
af 2n 
x @(k3): dk, dk, dk;, (XV.9) 
where 
z 1 EN 
ĝ(k)= Ow fe kxĝ (x) dx. (XV.10) 
The result is 
i = E0), (XV.11) 





A 1 A dt 3 
2=— L(— nt) — L(nt), (XV.11') 
t™— le 


APPENDICES 321 








a 2 s dt, 2 dt, 
T. = — L — = L nT, — At K 
* (2x87! | ae tı — le We J7 — ie 
diis 
oe ST ress he (XV.11") 
Ts-1 S18. 


Equation (XV.11) can be considered as a successive iteration on some 
linear integral equation 





z 2 1 3 d a 
Rm) = Êm) = — | Lins — nv’) —— Rin’) (XV.12) 
2n T — ie 
with the condition that 
T= >} Îĵĝ = RO). (XV.13) 
s=1 


Equation (XV.12) is the invariant equation of motion for the scat- 
tering matrix in the p-representation. Using (XV.12), we can for example 
show that the -matrix is unitary. In order to do this, we rewrite (XV.12) 
in the form 


R(n) = Êm) + | Linc’) P(e — vy de’, (XV.14) 
where 

. 1 R(nt — nr 

pee ee. (XV.15) 

2m t=T — ie 

Remembering that the Hermite conjugate is 

È (nm) = lend, (XV.16) 
the conjugate of Equation (XV.14) may be written as 

R* (— n) =Î (nt) + [a (T — 1) Ê (nt’) dt’, (XV.17) 
where 

= 1 R+- 

Ê+ -= SE (XV.18) 


Qn te —tt+ie 
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From (XV.14) and (XV.17), it follows that 
R(nt) — R+ (— nt) = fo) F(t —1') dr 
— f Ft — 1) Ên’) dr. (XV.19) 


On the other hand, by using these equations after simple transformation, 
we can show that 


k (= nr’) F(t — 1’) de’ — G (c' — 1) Ê (nt) de’ 


L (nt') F(t — t’) dt 


h 


2 fe œ — 1) L(nt) dr + 4, (1) — $,(0), (XV.20) 
where 


ĝ (t) = | Ft (r — 1") Le) F(t — 1") dt’ dt”, (XV.21) 
@,(t) = | Ft (c” — 1) Lene) Fe" —v)dt'dt".  (XV.21') 


We now successively replace the integration variables in (XV.21) by 


tase, t-n in 4, (0), (XV.22) 
t—ton, te—-t7é in (2). (XV.22') 


and as a result obtain 
ê, (1) = $2 (1) = [ro (n, t + & — n) F (n) dé dn. 
(XV.23) 
From (XV.19) and (XV.20), using (XV.23), it follows that 


R (nt) — R* (— nt) = ft- nt) F(t — 1') 
— F (r — 1) R(nt’)] dv’ 


or 
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R(nt) — R* (— nt) 








SRA ae — [Rt (— nr’) È (m — nv) 
Qn J t —ié 
+ Rt (nt! — nt) R(n7’/)]. (XV.24) 
Assuming that t=0 in (XV.24) and using the equation 
1 1 
— = P-+ ind(t), (XV.25) 
t— ie T 
we obtain 
R(0) — R* (0) = iR* O ÊO, (XV.26) 


i.e., the condition that the matrix S=1+iR(0) is unitary. Equation 
(XV.24) may be considered as the condition that the $-matrix is unitary 
for t40. 


APPENDIX XVI (TO SECTION 47) 


According to (38.14’) and (38.14”), the additional terms ¥, and ¥, have 
the form 


2 
@ x 
Y= MP ee p(w) d, (iM), (XVI.1) 
— 0% ~ r 
Y, = ate MPa 2? d; (iM). (XVI.1/) 


In this form the equations cannot be easily compared to observations 
because the real part, d and the imaginary part, a of the function N are 
taken at the point w=iM. 

In order to evaluate the functions ¥, and ¥, we can use the analytical 
amplitudes in an angle from 0° to 90° and with a large magnitude M. 
In order to do this, we consider the function 

f(z) = oy may. (XVI.2) 
Zz Zz 
where 


(XVI.3) 
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On the real axis, for z=w-—>0o, the function f(z) tends to a constant 
value which we shall denote by ix. This assumption is equivalent to as- 
suming that the total asymptotic cross section ø, is constant. In par- 
ticular, according to the optical theorem [see (XII-4)] 


oO 
Rea, =—. (XVL4) 
4n 


From (XVI.2) and (XVI.3), for |z|=V,V—-+00, using Lindeléf’s 
theorem [63] we have 


AVN) NV) 








{[WY= 7 E ioe (XVL.5) 
and for z= +iV 
f(iV) = a = ia. (XVI.6) 
Thus 
N (GV) = (i N(V) =—iN(V). (XV1.7) 
and, therefore, 
d(iV) = ReN(iV) =+a(V), (XVI.8) 
a(i V) = ImN(iV) =— d(V) (XVI.8') 


for large V. If the characteristic frequency is large enough so that the 
asymptotic value of f (z) is reached for V= M, then we may set 


d(iM) = a(M), (XVL9) 

a(iM) = d(M), (XVL9’) 
or, remembering that p(A/) =3, we obtain 

3 d(iM) = A(M), (XVI.10) 

7 a(iM) =— D(M). (XVI.10') 


Therefore, we can rewrite (38.13) and (38.13’) in the form 


w? — m? 


Po) = F n 


p(w) [42 (M) + A? (M)], (XVI.11) 
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G — m? 


TONS aa ae 





Slo) = [D? (M) + D? (M)], (XVI.11’) 


or, by expressing the amplitudes 4% in terms of the total cross sections 
AS. (M)=[k(M)/4z] c+ (M), we obtain 


Y, (œo) = p(w) [o+ (M) + o- (M)]. (XV1I12) 





4n /M +m? 
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